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Abstract 

The method of alternating projections (MAP) is a common method for solving feasibility prob- 
lems. While employed traditionally to subspaces or to convex sets, little was known about 
the behavior of the MAP in the nonconvex case until 2009, when Lewis, Luke, and Malick de- 
rived local linear convergence results provided that a condition involving normal cones holds 
and at least one of the sets is superregular (a property less restrictive than convexity). How- 
ever, their results failed to capture very simple classical convex instances such as two lines in 
three-dimensional space. 

In this paper, we extend and develop the Lewis-Luke-Malick framework so that not only 
any two linear subspaces but also any two closed convex sets whose relative interiors meet 
are covered. We also allow for sets that are more structured such as unions of convex sets. 
The key tool required is the restricted normal cone, which is a generalization of the classical 
Mordukhovich normal cone. We thoroughly study restricted normal cones from the viewpoint 
of constraint qualifications and regularity. Numerous examples are provided to illustrate the 
theory. 
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(1) X is a Euclidean space 



(i.e., finite-dimensional real Hilbert space) with inner product (•,•), induced norm || • ||, and in- 
duced metric d. 

Let A and B be nonempty closed subsets of X. We assume first that A and B are additionally 
convex and that A n B ^ 0. In this case, the projection operators Pa and Pg (a.k.a. projectors or 
nearest point mappings) corresponding to A and B, respectively, are single-valued with full do- 
main. In order to find a point in the intersection A and B, it is very natural to simply alternate the 
operator Pa and Pb resulting in the famous method of alternating projections (MAP). Thus, given a 
starting point b_i G X, sequences (fl«) n GN an d (&n)n€N are generated as follows: 

(2) (Vn G N) a n := P A K-i, K ■= P B a n - 

In the present consistent convex setting, both sequences have a common limit in AC\B. Not 
surprisingly, because of its elegance and usefulness, the MAP has attracted many famous math- 
ematicians, including John von Neumann and Norbert Wiener and it has been independently 
rediscovered repeatedly. It is out of scope of this article to review the history of the MAP, its many 
extensions, and its rich and convergence theory; the interested reader is referred to, e.g., [4], [7], 
[11], and the references therein. 

Since X is finite-dimensional and A and B are closed, the convexity of A and B is actually not 
needed in order to guarantee existence of nearest points. This gives rise to set-valued projection op- 
erators which for convenience we also denote by Pa and Pg. Dropping the convexity assumption, 
the MAP now generates sequences via 

(3) (Vn G N) a n G P A b n—l/ b n G P B a„. 

This iteration is much less understood than its much older convex cousin. For instance, global 
convergence to a point in A n B cannot be guaranteed anymore [9]. Nonetheless, the MAP is 
widely applied to applications in engineering and the physical sciences for finding a point in 
A n B (see, e.g., [25]). Lewis, Luke, and Malick achieved a break-through result in 2009, when 
there are no normal vectors that are opposite and at least one of the sets is superregular (a property 
less restrictive than convexity). Their proof techniques were quite different from the well known 
convex approaches; in fact, the Mordukhovich normal cone was a central tool in their analysis. 
However, their results were not strong enough to handle well known convex and linear scenarios. 
For instance, the linear convergence of the MAP for two lines in K 3 cannot be obtained in their 
framework. 

The goal of this paper is to extend the results by Lewis, Luke and Malick to make them applicable in 
more general settings. We unify their theory with classical convex convergence results. Our principal tool 
is a new normal cone which we term the restricted normal cone. A careful study of restricted normal 
cones and their applications is carried out. We also allow for constraint sets that are unions of superregular 



2 



(or even convex) sets. We shall recover the known optimal convergence rate for the MAP when studying 
two linear subspaces. In a parallel paper [5] we apply the tools developed here to the important 
problem of sparsity optimization with affine constraints. 

The remainder of the paper is organized as follows. In Section 2, we collect various auxiliary 
results that are useful later and to make the later analysis less cluttered. The restricted normal 
cones are introduced in Section 3. Section 4 focuses on normal cones that are restricted by affine 
subspaces; the results achieved are critical in the inclusion of convex settings to the linear conver- 
gence framework. Further examples and results are provided in Section 5 and Section 6, where we 
illustrate that the restricted normal cone cannot be obtained by intersections with various natural 
conical supersets. Section 7 and Section 8 are devoted to constraint qualifications which describe 
how well the sets A and B relate to each other. In Section 9, we discuss regularity and superreg- 
ularity notions that extend the idea of convexity, for sets and collections of sets. We are then in a 
position to provide in Section 10 our main results dealing with the local linear convergence of the 
MAP. 



Notation 

The notation employed in this article is quite standard and follows largely [6], [22], [23], and 
[24]; these books also provide exhaustive information on variational analysis. The real num- 
bers are R, the integers are Z, and N := {z G Z | z > 0}. Further, R + := {x G R | x > 0}, 

R ++ := {x G R | x> 0} and R and R are defined analogously. Let R and S be subsets 

of X. Then the closure of S is S, the interior of S is int(S), the boundary of S is bdry(S), and 
the smallest affine and linear subspaces containing S are aff S and span S, respectively. The lin- 
ear subspace parallel to aff S is parS := (aff S) — S = (aff S) — s, for every s G S. The rela- 
tive interior of S, ri(S), is the interior of S relative to aff(S). The negative polar cone of S is 
S© = {u G X | sup (u, S) < 0}. We also set S® := -S® and S x := S® n S®. We also write R S 
for R + S := {r + s | (r,s) G R x S} provided that R _L S, i.e., (V(r,s) G R x S) (r,s) = 0. We 
write F : X =t X, if F is a mapping from X to its power set, i.e., gr F, the graph of F, lies in X x X. 
Abusing notation slightly, we will write F(x) = y if F(x) = {y}. A nonempty subset X of X is a 
cone if (VA G R+) AX :={AA;|A;GX}CX. The smallest cone containing S is denoted cone(S); 
thus, cone(S) := R + • S := {ps | p G R+,s G S} if S ^ and cone(0) := {0}. The smallest con- 
vex and closed and convex subset containing S are conv(S) and conv (S), respectively If z G X 
and p G R++, thenball(z;jo) := {x G X | d(z,x) < p] is the closed ball centered atz with radius p 
while sphere(z; p) := {x G X | d(z, x) = p] is the (closed) sphere centered at z with radius p. If u 
and v are in X, then [u, v] := { (1 — A)u + Av | A G [0, 1] } is the line segment connecting u and v. 

2 Auxiliary results 

In this section, we fix some basic notation used throughout this article. We also collect several 
auxiliary results that will be useful in the sequel. 
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Projections 



Definition 2.1 (distance and projection) Let Abe a nonempty subset ofX. Then 

(4) d A : X -> R: x i-4 inf d(x,a) 

a<=A 

is the distance function of the set A and 

(5) P A : X =4 X: x i-> [a G A \ d A (x) = d(x,a)} 
is the corresponding projection. 

Proposition 2.2 (existence) Let A be a nonempty closed subset ofX. Then (Vx G X) Pa{ x ) ^ - 

Proof. Letz G X. The function / : X->K:i^ ||x — z|| 2 is continuous and hm^ x ^ +00 f(x) = +oo. 
Let (x n ) nG N be a sequence in A such that f(x n ) -4 inf /(A). Then (x n ) ne jsj is bounded. Since A is 
closed and / is continuous, every cluster point of (x n ) ne N is a minimizer of / over the set A, i.e., 
an element in P A z. ■ 

Example 2.3 (sphere) Let z G X and p G K.++. Set S := sphere(z;p). Then 

(6) (VxGX) P s (x) = ( Z + ^' ifx ^ Z; 

[ S, otherwise. 

Proof. Let x G X. The formula is clear when x = z, so we assume x ^ z. Set 

(7) c := z + jO . X ~ Z M G S, 

||x — z|| 

and let s = z + pb G S \ {c}, i.e., ||fc|| = 1 and b ^ (x — z)/||x — z||. Hence, using that |||w|j — 
||z?||| < || m — v\\ (u,v) < and because of Cauchy-Schwarz, we obtain 

(8a) || x — c\\ = | ||x — z|| — p\ = | ||x — z|| — |||Ofc||| = |||x — z|| — ||s — z||| 

(8b) < ||x-s||. 

We have thus established (6). ■ 

In view of Proposition 2.2, the next result is in particular applicable to the union of finitely many 
nonempty closed subsets of X. 

Lemma 2.4 (union) Let (AOiez be a collection of nonempty subsets of X, set A := |J 1GJ A[, let x G X, 
and suppose that a G Pa( x )- Then there exists i G I such that a G Pa ; (*)- 

Proof. Indeed, since a £ A, there exists i G J such that a G A,. Then d(x,a) = dA(x) < dA^x) < 
d(x,a). Hence d(x,a) = d Ai (x), as claimed. ■ 

The following result is well known. 
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Fact 2.5 (projection onto closed convex set) Let Cbea nonempty closed convex subset ofX, and let x, 
y and p be in X. Then the following hold: 

(i) Pc(x) is a singleton. 

(ii) Pc(x) = p if and only if p EC and sup (C — p,x — p) < 0. 

(iii) \\p c (x) -Pc(y)\\ 2 + II (id -p c )(x) - (id-p c )(3/)|| 2 < lk-yll 2 - 

(iv) \\P c (x) - P c (y)\\ < \\x-y\\. 

Proof. (i)&(ii): [4, Theorem 3.14]. (iii): [4, Proposition 4.8]. (iv): Clear from (iii). ■ 

Miscellany 

Lemma 2.6 Let A and B be subsets ofX, and let Kbea cone in X. Then the following hold: 

(i) cone(A n B) C cone A n cone B. 

(ii) cone(K n B) = K n cone B. 

Proof, (i): Clear, (ii): By (i), cone(i<C n B) C (cone K) n (cone B) = XRconeB. Now assume that 
x G (X n cone B) \ {0}. Then there exists 8 > such that i//J 6 B. Since K is a cone, x/6 E K. 
Thus x/fi G X n B and therefore x G cone(X n B). ■ 

Note that the inclusion in Lemma 2.6(i) may be strict: indeed, consider the case when X = R, 
A := {1}, and B = {2}. 

Lemma 2.7 (a characterization of convexity) Let A be a nonempty closed subset ofX. Then the follow- 
ing are equivalent: 

(i) A is convex. 

(ii) P^" 1 (fl) — a is a cone, for every a G A. 

(iii) Pa(x) is a singleton, for every i£l 

Proof. "(i)=>(ii)": Indeed, it is well known in convex analysis (see, e.g., [24, Proposition 6.17]) that 
for every a G A, P A l (a) — a is equal to the normal cone (in the sense of convex analysis) of A at a. 

"(ii)=>(iii)": Let x G X. By Proposition 2.2, P A x ^ 0. Take a\ and a^ in P A x. Then \\x — a\ || = 
||x — a?\\ and x — a\ G P A 2 «i — fli. Since P A X « — fl is a cone, we have 2(x — a\) G P A X fli — «i- Hence 
y := 2x — fli G P^ai and y — x = x — a\. Thus, 

(9a) (y - a 2 , a x - a 2 ) = ((y - x) + (x - a 2 ), («i - x) + (x - b 2 )> 
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(9b) = (y — x,a\ — x) + (y — x, x — aj) + (x — a2,a\ — x) + \\x — a2\Y 

(9c) = (x — a\, a\ — x) + (x — a\, x — a<i) + (x — ci2, a\ — x) + \\ x — a-i 

(9d) = — ||x — fli|| 2 + \\x — fl2|| 2 

(9e) = 0. 

Since a\ € P A y, it follows that 

(10a) ||y — a\ || 2 = \y — + 2 (y — fl2/«2 — «i) + ||«i — «2|| 2 

(10b) = \\y — fl2|| 2 + ||«i — «2|| 2 

(ioc) >l!y-«2|| 2 

(10d) ^Hy-flill 2 . 



Hence equality holds throughout (10). Therefore, a\ = a^. 

"(iii)=>(i)": This classical result due to Bunt and to Motzkin on the convexity of Chebyshev sets 
is well known; for proofs, see, e.g., [11, Chapter 12] or [4, Corollary 21.13]. ■ 

Proposition 2.8 Let Shea convex set. Then the following are equivalent. 

(i) G riS. 

(ii) cone S = span S. 

(iii) coneS = spanS. 

Proof. Set Y = span S. Then (i) belongs to the interior of S relative to Y. 

"(i)=Kii)": There exists 5 > such that for every y G Y \ {0}, <5y/||y|| G S. Hence y G cone S. 

"(ii)=>(i)": For every y G Y, there exists £ > such that 5y G S. Now [23, Corollary 6.4.1] applies 
inY. 

"(ii)<^>(iii)" : Set K = coneS, which is convex. By [23, Corollary 6.3.1], we have riX = ri Y <^> 
I=yoriYCKCy. Since ri Y = Y = Y, we obtain the equivalences: riK = Y O X = Y <^> 
X = Y. ■ 



3 Restricted normal cones: basic properties 

Normal cones are fundamental objects in variational analysis; they are used to construct subd- 
ifferential operators, and they have found many applications in optimization, optimal control, 
nonlinear analysis, convex analysis, etc.; see, e.g., [4], [6], [8], [19], [22], [23], [24]. One of the key 
building blocks is the Mordukhovich (or limiting) normal cone Na, which is obtained by limits 
of proximal normal vectors. In this section, we propose a new, very flexible, normal cone of A, 
denoted by N^, by constraining the proximal normal vectors to a set B. 
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Definition 3.1 (normal cones) Let A and B be nonempty subsets ofX, and let a and u be in X. If a G A, 

then various normal cones of A at a are defined as follows: 

(i) The B-restricted proximal normal cone of A at a is 

(11) Nl(a) := cone ((B nP A 1 a) - a) = cone((B-a) n(P A 1 a-a)). 

(ii) The (classical) proximal normal cone of A at a is 

(12) N p A mx (a) := N$ (a) = cone (P A l a - a) . 

(iii) The B-restricted normal cone N A (a) is implicitly defined by u G N A (a) if and only if there exist 
sequences (fl /1 )uGN A and (u n ) n ^ in N A (fl n ) such that a n — > a and w n -4 M. 

(iv) The Frechet normal cone N A re (a) is implicitly defined by u G N A re (a) zf and onZy if (Ve > 0) 
(3<J > 0) (Vx G Anball(a;£)) (u,x-a) < e||x-«||. 

(v) The normal convex from convex analysis N A nv (a) is implicitly defined by u G N A onv (a) if and 
only if sup (u, A — a) < 0. 

(vi) 77/e Mordukhovich normal cone N^(fl) o/A at a is implicitly defined by u G Na(a) if and onZy if 
f/iere exz'sf sequences (a n ) ne ^ in A and (Wn)neN m N A r0X (a n ) suc/z z7zaf a„ — > a and u n — > zz. 

If a £ A, then all normal cones are defined to be empty. 




Remark 3.2 Some comments regarding Definition 3.1 are in order. 

(i) Clearly, the restricted proximal normal cone generalizes the notion of the classical proximal 
normal cone. The name "restricted" stems from the fact that the pre-image P A 1 a is restricted 
to the set B. 
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(ii) See [24, Example 6.16] and [22, Subsection 2.5.2.D on page 240] for further information re- 
garding the classical proximal normal cone, including the fact that 

(13) « 6 Nf » O a G A and (3 3 > 0)(Vx G A) (u, x - a) < 5\\x - a\\ 2 . 
This also implies that: N p ™ x (a) + (A-a) e C N p ™ x (a). 

(iii) Note that grN^ = (A x X) n grNjj. Put differently, N%(a) is the outer (or upper Kura- 
towski) limit of (x) as x — > a in A, written 

(14) N» =BmN2(*). 
See also [24, Chapter 4]. 

(iv) See [22, Definition 1.1] or [24, Definition 6.3] (where this is called the regular normal cone) 
for further information regarding N^ re (a). 

(v) The Mordukhovich normal cone is also known as the basic or limiting normal cone. Note 
that N A = N% and grN A = (AxX)ngrN^ = (A x X) n grN^ rox and once again N A (a) 
is the outer (or upper Kuratowski) limit of N*{x) or N^° x (x) as x —> a in A. See also [22, 
page 141] for historical notes. 

The next result presents useful characterizations of the Mordukhovich normal cone. 

Proposition 3.3 (characterizations of the Mordukhovich normal cone) Let A be a nonempty closed 
subset ofX, let a G A, and let u G X. Then the following are equivalent: 

(i) u G N A {a). 

(ii) There exist sequences (A„)„ GN in K.+, (fc n )neN i n X, {a„) ne -^ in A such that a n — > a, A n (b n — 
a n ) —> u, and (Vn G N) fl„ G PaV 

(iii) Tnere exist sequences (A n ) nG]N m R + , (x„)„ € ]n in X, (fl n ) n€ ]N zn A such that x n —> a, A n (x„ — 
a„) —> u, and (Vn G N) « n G Pax«. (T/zz's flZso implies a n — > flj 

(iv) T/zere exzsf sequences (fl n )neN in A and (n n )neN in X such that a n —> a, u n —> u, and (Vn G N) 
U n G N^(« n ). 

Proof, "(i)o(ii)": Clear from Definition 3.1(vi). 

"(iii)o(iv)": Noting that the definition of N A (a) in [22] is the one given in (iv), we see that this 
equivalence follows from [22, Theorem 1.6]. 

"(ii)^(iii)": Let (A n ) n6N/ (a n ) neK , and (b n ) neN be as in (ii). For every n G N, since a n G P A b n , 
[24, Example 6.16] implies that a n G P A [a„,b n ]. Now let (£ n )neiN be a sequence in ]0, 1[ such that 
e„a„ —> and e„b„ —> 0. Set 

(15) (Vn G N) x n = (1 -£„)«„ + e n b n = a n + £„(b n - a n ) G [a n ,b n \. 
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Then x n — > a and (Vn G N) a n G Pax„- Furthermore, (A n /e n ) n€ isj lies in R + and 
(16) (A n /e n )(x n - a n ) = A n (b n - a n ) -> u. 



"(iii)=>(ii)": Let (\ n )neNt (x n )neN/ and (fl„)„ £ N be as in (iii). Since x n — > a and a G A, we 
deduce that < ||x n — a„|| = dA(x n ) < \\x n — a\\ —> 0. Hence x„ — a n — > which implies that 
fl n — a = a„ — x n + x n — a —> + = 0. Therefore, (ii) holds with (fc n )neN = (*n)ne]N- * 

Here are some basic properties of the restricted normal cone and its relation to various classical 
cones. 

Lemma 3.4 (basic inclusions among the normal cones) Let A and B be nonempty subsets ofX, and 
let a G A. Then the following hold: 

(i) N^(a)CNl TOX (a). 

(ii) N|(a) = cone((B - a) n (P^a - a)) C (cone(B - a)) n NP rox (a). 

(iii) N|(a) C (a) = N p A rm {a) and N B A {a) C N A (a). 

(iv) N*(a) CN»(«). 

(v) 7/ A is dosed, f/zen NP rox (a) C N™(a). 

(vi) 7/ A is c/osed, then N%*(a) C N A (a). 

(vii) If A is closed and convex, then N|(a) = N prm {a) = N™(a) = N c A om {a) = N A {a). 

(viii) If a G ri(A), then Nf [A \a) = Nf {A \a) = {0}. 

(ix) (aff(A) -a) 1 C (A-a) e . 

(x) (A-fl) e ncone(B-fl) C N^(fl) C cone(B-a). 

Proof, (i): Take u G N A om » and fix an arbitrary 8 > 0. Then (Vx G A) (u, x - a) < < 5 \\x - a\\ 2 . 
In view of (13), u G N p A m {a). 

(ii) : In view of Lemma 2.6, the definitions yield 

(17a) N^(fl) = cone ((B n P^fl) - a) = cone ((£> - a) n (P A x a - a)) 

(17b) C cone ((B - a) n cone(P A 1 a - a)) = cone ((B - a) n NP rox (a)) 

(17c) =cone(B-fl)nNP rox (fl). 

(iii) , (iv) and (ix): This is obvious. 
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(v) : Assume that A is closed and take u G N A ° x (a). By (13), there exists p > such that 
(Vx G A) (u,x-a) < p\\x-a\\ 2 . Now let e > and set 5 = e/p. If x G A n ball(a; J), then 
(u,x — a) < p\\x — a\\ 2 < pS\\x — a\\ = e\\x — a\\. Thus, w G N^ re (a). 

(vi) : This follows from Proposition 3.3. 

(vii) : Since A is closed, it follows from (i), (v), and (vi) that 



On the other hand, by [22, Proposition 1.5], N A (a) Q N A onv (a) because A is convex. 

(viii): By assumption, (3 5 > 0) ba\\(a;5) n aff(A) C A. Hence aff(A) n P A x a = {a} and thus 
Nf {A) (a) = {0}. Since a G ri(A), it follows that (Vx G baU(a;<5/2) n aflf(A)) N^ ff(A) (x) = {0}. 
Therefore, Nf (A) (fl) = {0}. 

(x): Take u £ ((A — a) e n cone(B — a)) n {0}, say w = A(b — a), where b G B and A > 0. Then 
> sup ( A — a, u) = A sup (A — a, b — a) = sup A (conv A — a, b — a). By Fact 2.5(ii), a = PconvAb 
and hence a = P A b. It follows that u G cone((B DP^a) — a). The left inclusion thus holds. The 
right inclusion is clear. ■ 

Remark 3.5 (on closedness of normal cones) Let A be a nonempty subset of X, let a € A, and 

let B be a subset of X. Then N%(a), N A (a), and N c A onv (a) are obviously closed — this is also true 
for N^(a) but requires some work (see [24, Proposition 6.5]). On the other hand, the classical 
proximal normal cone N^ rox (fl) = N^(a) is not necessarily closed (see, e.g., [24, page 213]), and 
hence neither is N A (a). For a concrete example, suppose that X = R 2 , that A = {(0,0)}, that 
B = R x {1} and that a = (0,0). Then N%(a) = (R x R ++ ) U {(0,0)}, which is not closed; 
however, the classical proximal normal cone N^ rox (fl) = R 2 is closed. 

The sphere is a nonconvex set for which all classical normal cones coincide: 

Example 3.6 (classical normal cones of the sphere) Let z G X and p G R++. Set S := sphere(z; p) 
and let s G S. Then Nf ox (s) = Nf(s) = N F s r€ (s) = N s (s) = R(s - z). 

Proof. By Example 2.3, we have Pg 1 (s) = z + R+(s — z) and so Pg 1 (s) — s = [—1, +oo[ • (s — z). 
Hence, using Lemma 3.4(v)&(vi), we have 



(18) 



Nr v («) C N p A mx (a) C N™(a) C N A (a). 



(19a) 
(19b) 



Nf OX (s)=N s x (s)=R(s 
= Hm< 0X (s / ) 



z)CN™(s)CN s (s) 



limR(s'-z) =R(s-z) 



s'gS 



(19c) 



= Nf ox (s), 



as announced. 



Here are some elementary yet useful calculus rules. 
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Proposition 3.7 Let A, A\, A2, B, B\, and B2 be nonempty subsets of X, let c <E X, and suppose that 
a G A n Ai n A2. Then the following hold: 

(i) If A and B are convex, then N A (a) is convex. 

(ii) N^ lUB2 (fl) = N Bl (a) U N B A 2 (a) and N B A llJB2 (a) = N Bi (a) U N B A 2 (a). 

(iii) IfB C A, then N^(a) = N%{a) = {0}. 

(iv) If A! C A 2 , then N B 2 (a) C N^(«). 

(v) -N^(a) = NZ B A {-a), -N B (a) = NZ B A {-a), and -N A (a) = N_ A (-a). 

(vi) N B (a) = N B Z c c {a - c) and N B (a) = N B Z c c {a - c). 

Proof. It suffices to establish the conclusions for the restricted proximal normal cones since the 
restricted normal cone results follows by taking closures (or outer limits), (i): We assume that 
B fl P A l a 7^ 0, for otherwise the conclusion is clear. Then P A l {a) = P^a = (Id +N A )a is convex 
(as the image of the maximally monotone operator Id +N A at a). Hence (B C\ P^a) — a is convex 
as well, and so is its conical hull, which is N A (a). (ii): Since ((£>i U B2) Pi P^a) — a = {{B\ fl 
P^a) — fl) U ((£>2 n P^a) — a), the result follows by taking the conical hull, (iii): Clear, because 
(£> fl P A l a) — fl is either empty or equal to {0}. (iv): Suppose A(b — a) £ N A (a), where A > 0, 

b G B, and a G P Al b. Since a G A\ C A2, we have a G P Al b. Hence A(b — a) £ N A (a), (v): This 
follows by using elementary manipulations and the fact that P_ A = (— Id) o P A o (— Id), (vi): This 
follows readily from the fact that P A \ c {a — c) = P A 1 (a) — c. ■ 

Remark 3.8 The restricted normal cone counterparts of items (i) and (iv) are false in general; see 
Example 5.1 (and also Example 5.4(iv)) below. 



The Mordukhovich normal cone (and hence also the Clarke normal cone which contains the 
Mordukhovich normal cone) strictly contains {0} at boundary points (see [22, Corollary 2.24] or 
[24, Exercise 6.19]); however, the restricted normal cone can be {0} at boundary points as we 
illustrate next. 

Example 3.9 (restricted normal cone at boundary points) Suppose that X = R 2 , set A := 

ball(0;l) = {x G R 2 | ||x|| < 1} and B:=Ex {2}, and let fl = (fli,fl 2 ) G A. Then 



(20) N» = 
Consequently, 

(21) Nl(fl) = 



R+fl, if ||fl|| = 1 and «2 > 0; 
{ (0, 0) }, otherwise. 



R+fl, if ||fl|| = 1 and «2 > 0; 
{ (0, 0) }, otherwise. 
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Thus the restricted normal cone is {(0,0)} for all boundary points in the lower half disk that do 
not "face" the set B. 

Remark 3.10 In contrast to Example 3.9, we shall see in Corollary 4.11(ii) below that if A is closed, 
B is the affine hull of A, and a belongs to the relative boundary of A, then the restricted normal 
cone N^(fl) strictly contains {0}. 



4 Restricted normal cones and affine subspaces 

In this section, we consider the case when the restricting set is a suitable affine subspace. This 
results in further calculus rules and a characterization of interiority notions. 

The following four lemmas are useful in the derivation of the main results in this section. 

Lemma 4.1 Let A and B be nonempty subsets ofX, and suppose that c G A n B. Then 

(22) aff(AUB)-c = span(B-A). 

Proof. Since c G An B C A U B, it is clear that the aff (A U B) — c is a subspace. On the one 
hand, if a G A and b G B, then b - a = l-b + (-1) • a + 1 • c - c G aff (A U B) - c. Hence 
B - A C aff(A U B) - c and thus span(B - A) C aff (A U B) - c. On the other hand, if x G 
aff (A U B), say x = YLi&i K a i + Eye/ Fjbj, where each a, belongs to A, each by belongs to B, and 
E/G7 A ; + E/e/ /"/ = !' then * - c = Dgz(- a ;)( c - + E;g7 ~ c ) ^ span(B - A). Thus 
aff(AUB) -c C span(B- A). ■ 

Lemma 4.2 Lef A fee a nonempty subset ofX, let a G A, and Zef w G (aff (A) — a)- 1 . T/zen 

(23) (Vac G X) P A {x + u) = P A (x). 

Proof. Let x G X. For every 1)6 A, we have 

(24a) ||w + x-b|| 2 = \\u\\ 2 + 2(u,x -b) + ||x-fc|| 2 

(24b) = \\u\\ 2 + 2{u,x - a) +2{u,a-b) + ||x-fc|| 2 

(24c) = ||«|| 2 + 2(i/,x-fl) + ||x-fc|| 2 . 

Hence Pa{x + u) = argmin foeA \\u + x — £>|| 2 = argmin fcGA ||x — b\\ 2 = P A x, as announced. ■ 
Lemma 4.3 Let A be a nonempty subset ofX, and let L be an affine subspace ofX containing A. Then 
(25) P A = P A o P L . 

Proof. Let a G A and x G X, and set = P L x. Using [4, Corollary 3.20(i)], we have x — b G 
(L - a) 1 C (aff(A) - a)- 1 . In view of Lemma 4.2, we deduce that (P A o P L )x = P A (b) = P A (b + 
(x-b)) = P A x. U 
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Lemma 4.4 Let Abe a nonempty subset ofX, let a <E A, and let L be an affine subspace ofX containing 
A. Then the following hold: 



(i) N L A (a)±(L-a)^. 

(ii) N L A (a)L(L-a)^. 

Proof. Observe that L — a = par(A) does not depend on the concrete choice of a G A. (i): Using 
Lemma 3.4(x), we see that N A (a) C cone(L — a) C span(L — a) _L (span(L — a)) 1 - = (L — a) 1 - — 

(par A)- 1 , (ii): By (i), ranN A C par A. Since ran N A C ran N\, it follows that ran N A C par A = 
L — a. ■ 

For a normal cone restricted to certain affine subspaces, it is possible to derive precise relation- 
ships to the Mordukhovich normal cone. 

Theorem 4.5 (restricted vs Mordukhovich normal cone) Let A and B be nonempty subsets of X, 
suppose that a € A, and let L be an affine subspace ofX containing A. Then the following hold: 

(26a) N%(a) = N%(a) © (L — «) x = N%(a) + (L — a)\ 

(26b) N^(a) = N%(a) n (L - a), 

(26c) N A (a) = N^(fl) (L - fl) x = N A (a) + (L — a)\ 

(26d) N^(fl) = N A (fl) n (L-fl). 

Consequently, the following hold as well: 

(27a) N^(a) = N A ff(A) (fl) © (aff(A) - a) 1 - = N%(a) + (aff(A) - a)\ 

(27b) Nf {A \a) = N^(a) n (aff(A) - a), 

(27c) N A («) = N A ff(A) (fl) © (aff(A) - fl) x = N A (a) + (aff(A) - «)\ 

(27d) N A ff(A) (fl) = N a (a) n (aff(A) - a), 

(27e) a G A n B N A ff(AUB) (fl) = N A (a) n span(A - B). 

Proof. (26a): Take w G N A (a). Then there exist A > 0, x G X, and a G P A x such that A(x — a) = u. 
Set = Pix. By Lemma 4.3, we have a G P A x = (P A o P L )x = P A b. Using [4, Corollary 3.20(i)], we 
thus deduce that A(b - a) G N A (fl) and A(x - b) G (L - fc) x = (L - fl) x . Hence m = A(b - a) + 
A(x - b) G N A («) + (L - fl) x = N A (fl) © (L - a) x by Lemma 4.4(i). We have thus shown that 

(28) N%(a) C N^(fl) © (L-a) x . 

On the other hand, Lemma 3.4(iii) implies that N A (a) C N A (a) and thus 

(29) Aft(fl) + (L-fl) x C N^(<i) + (L-a) x . 
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Altogether, 



(30) N%(a) C N^(fl) © (L - fl) x C N|(fl) + (L — a) x . 

To complete the proof of (26a), it thus suffices to show that N^(a) + (L — a) 1 - C N^(a). To this 
end, let m G N^(a) and u G (L - a) 1 - C (aff(A) - a) ± . Then there exist A > 0, b G X, and a G Pa» 
such that u = A(b - a). If A = 0, then u = and m + i> = u G (aff(A) - a) 1 - C (A - a) e = 
(A — a) e n X = (A — a) e n cone(X - fl) C Nj£(fl) by Lemma 3.4(ix)&(x). Thus, we assume that 
A > 0. By Lemma 4.2, we have a G Pa» = Pa(^ + A _1 y). Hence b + A _1 u — a G N*(a) and 
therefore A(b + A _1 u — a) = A(b — a)+v = u + vE N^(a), as required. 

(26b): By Lemma 3.4(iii)&(x), N\{a) C N£(«) fl (L — a). Now let u G N|(fl) n (L - fl). By 
(26a), we have w = v + w, where v G N\(a) Q L — a and w G (L — a) ± . On the other hand, 
w = u — v G (L — a) — (L — a) = L — a. Altogether w G (L — a) D (L — a) 1 - = {0}. Hence 
u = v G N A (a)- 

(26c): Let u G N A (a). By definition, there exist sequences (fl n )neN i n -A and (m«)«gn i n X such 
that a n —> a, u n —> u, and (Vn G N) u„ G N^(a n ). By (26a), there exists a sequence {v„,w n ) n< =f^ 
such that (fln/^«)nGN lies in grN^, (itf n )neN lies in (L — fl)- 1 , and (Vn G N) u„ = v n + zt>„ and 
Z7 n _L w n . Since ||«|| 2 <— || w„ || 2 = ||p n || 2 + ll^nll 2 / the sequences (v n ) n€ j^ and (w n )n€N are bounded. 
After passing to subsequences and relabeling if necessary, we assume (v n ) n ^ and (w n )n€N are 
convergent, with limits v and w, respectively. It follows that v G N^(a) and w G (L — a)- 1 ; 
consequently, u = v + w E N\{a) ® (L - a) L by Lemma 4.4(h). Thus N A (a) C N^(fl) © (L - a)- 1 . 
On the other hand, by Lemma 3.4(iii), N^(a) © (L - fl) 1 - C N^(fl) + (L - a) L . Altogether, 

(31) N A (a) C N^(fl) © (L - fl) x C N A (a) + (L — fl) x . 

It thus suffices to prove that N A (a) + (L — a) 1 - C N A (a). To this end, take w G N A (a) and i> G 
(L — fl)- 1 . Then there exist sequences (fl n )neN in A and (w n )neN in X such that a n — > a, w„ — > u, 
and (Vn G N) M„ G N*(fl n ). For every n G N, we have L — a = L — a n and hence £ 
N^(a n ) + (L — An) 1 - = N^(a n ) by (26a). Passing to the limit, we conclude that u-\- v E N A (a). 

(26d): First, take u G N^(fl). On the one hand, by Lemma 3.4(iii), u G N A (a). On the other 
hand, by Lemma 4.4(h), u G (L — a) -11 - = L — a. Altogether, we have shown that 

(32) N^(a) C N A (a) n (L-fl). 

Conversely, take w G N A (a) D (L — a) C N A (a). By (26c), there exist c G N^(a) and w G (L — fl) 1 - 
such that w = i; + if and c _L if . By (32), v E L — a. Hence w = u — v E (L — a) — (L — a) = 
L — a. Since w E (L — a) ± , we deduce that w = 0. This implies u = r? G N^(fl). Therefore, 
N^(fl) n (L-fl) C N^(fl). 

"Consequently" part: Consider (26) when L = aff(A) or L = aff (A U B), and recall Lemma 4.1 
in the latter case. ■ 

An immediate consequence of Theorem 4.5 (or of the definitions) is the following result. 
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Corollary 4.6 (the X-restricted and the Mordukhovich normal cone coincide) Let A be a nonempty 
subset ofX, and let a € A. Then 

(33) N$(a)=N A (a). 

The next two results provide some useful calculus rules. 

Corollary 4.7 (restricted normal cone of a sum) Let C\ and C2 be nonempty closed convex subsets of 
X, let a\ G C\, let a 2 G C 2 , and let L be an affine subspace ofX containing C\ + C2. Then 

(34) Nt 1+C2 (ai + a 2 ) = N^(a t ) n N^(a 2 ). 

Proof. Set C = Ci + C2 and a = a\ + a 2 . Then (26d) and [24, Exercise 6.44] yield 

(35a) N£(a) = N c (a) n (L - a) = N Ci (fli) n N C2 (fl 2 ) H (L — a) 

(35b) = (N Cl («i)n(L-fl)) n (N C2 (a 2 )n(L-fl)). 

Note that L — a is a linear subspace of X containing Ci — a\ and C2 — a 2 . Thus, L — a 2 = L — a + a\ 
is an affine subspace of X containing Q, and L — a\ = L — a + a 2 is an affine subspace of X 
containing C 2 . By (26d), 

(36) N^ fl2 (fli) = N Q (fli) n (L - a) and N^ ai (a 2 ) = N Cl (a 2 ) C\(L — a). 

The conclusion follows by combining (35) and (36). ■ 

Corollary 4.8 (an intersection formula) Let A and B be nonempty closed convex subsets ofX, and sup- 
pose that a € ADB. Let Lbe an affine subspace ofX containing AUB. Then 

(37) N»n(-N B L (fl)) =N L A Z a B (0). 

Proof. Using (26d), Proposition 3.7(v), [24, Exercise 6.44], and again (26d), we obtain 

(38a) N^(a) n ( - N%(a)) = N A (a) n (L - a) n ( - N B (a)) n (L - a) 

(38b) = (N A (fl)n(-N B (fl))) n(L-a) 

(38c) =(N A («)nN_ B (-fl))n(L-fl) 

(38d) =N A _ B (0)n(L-fl) 

(38e) =N^«(0), 

as required. ■ 



Let us now work towards relating the restricted normal cone to the (relative and classical) inte- 
rior and to the boundary of a given set. 
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Proposition 4.9 Let A be a nonempty subset of X, let a G A, let L be an affine subspace containing A, 
and suppose that N^(a) = {0}. Then L = aff(A). 

Proof. Using <E Nf (A) (a) C N^(a) = {0} and applying (26c) and (27c), we have 

(39) N A (a) = + (L — a) L = + (aff(A) - a) L . 

SoL-a = aff(A)-a,i.e., L = aff(A). ■ 
Theorem 4.10 Let A and B be nonempty subsets ofX, and let a G A. Then 

(40) N%(a) = {0} & (33 > 0)(Vx G Anball(a;<J)) P A X {x) HB C {x}. 
Furthermore, if A is closed and B is an affine subspace ofX containing A, then the following are equivalent: 

(i) N B A (a) = {0}. 

(ii) (Bp > 0)ball(a;p) f)B C A. 

(iii) B = aff(A) and a G ri(A). 

Proof. Note that Nj(a) = {0} O (3<5 > 0) (Vx G A nball(a;<$)) Nj(x) = {0}. Hence (40) follows 
from the definition of N^(x). 

Now suppose that A is closed and B is an affine subspace of X containing A. 

"(i)^(ii)": Let 5 > be as in (40) and set p := 5/2. Let G B(a;,o) n B, and take x G which 
is possible since A is closed. Then \\b — x|| = d A (b) < \\b — a\\ < p and hence 

(41) \\x-a\\ < \\x - b\\ + \\b - a\\ < p + p = 2p = 5. 

Using (40), we deduce that b G P A H X ) n B C {x} C A. 

"(ii)^(iii)": It follows that B = aff(B) C aff(A) C B; hence, B = aff(A). Thus baU(a;p) D 
aff(A) C A, which means that a G ri(A). 

"(iii)=>(i)": Lemma 3.4(viii). ■ 

Corollary 4.11 (interior and boundary characterizations) Let A be a nonempty closed subset of X, 
and let a € A. Then the following hold: 

(i) Nf {A) {a) = {0}OflGri(A). 

(ii) Nf {A \a) £ {0} e> a G A \ ri(A). 

(iii) N A (fl) = {0} O a G int(A). 
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(iv) N A (a) /{0}OiiGA\ int(A). 

Proof, (i): Apply Theorem 4.10 with B = aff(A). (ii): Clear from (i). (iii): Apply Theorem 4.10 with 
B = X, and recall Corollary 4.6. (iv): Clear from (iii). ■ 

A second look at the proof of (i)=> (ii) in Theorem 4.10 reveals that this implication does actually 
not require the assumption that B be an affine subspace of X containing A. The following example 
illustrates that the converse implication fails even when B is a superset of aff (A). 

Example 4.12 Suppose that X = R 2 , and set A := R x {0}, a = (0,0), and B = Rx {0,2}. Then 
A = aff (A) C B and ball(a;l) n B C A; however, (Vx G A) Nj(x) = {0} x R + and therefore 
N» = {0}xR+^{(0,0)}. 

Two convex sets 

It is instructive to interpret the previous results for two convex sets: 

Theorem 4.13 (two convex sets: restricted normal cones and relative interiors) Let A and B be 

nonempty convex subsets ofX. Then the following are equivalent: 

(i) ri A n ri B ^ 0. 

(ii) G ri(B - A). 

(iii) cone(B - A) = span(B - A). 

(iv) N A (c) n (-Nb(c)) ncone(B — A) = {0} for some c G AnB. 

(v) N A (c) n (-Nb(c)) n cone(B - A) = {0} for every c G AnB. 

(vi) N A (c) n (-N B (c)) nspan(B — A) = {0} for some c G A n B. 

(vii) N A (c) n (-Nb(c)) n span(B - A) = {0} for every c G AnB. 

(viii) Nf (AUB) (c) n (-Nf {AUB \c)) = {0} /or some c G AnB. 

(ix) Nf (AUB) (c) n (-Nf (AUB) (c)) = {0} for every c G AnB. 

(x) <_f- A) (0) = {0}. 

Proo/. By [23, Corollary 6.6.2], (ii) <^riAnriB^0oOGriA-riB«=> (ii). 

Applying Proposition 2.8 to B — A, and [3, Proposition 3.1.3] to cone (B — A), we obtain 
(42a) (ii) O (iii) O cone(B - A) = span(B - A) 
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(42b) Ocone(B-A)n (cone(B- A)) w = {0}. 

Let c G A n B. Then Corollary 4.8 (with L = X) yields N A {c) n ( - N B (c)) = N A - B (0) = (A - 
B) e = (B- A)® = (cone(B- A))®. Hence 

(43) (VcGC) N A (c) n (-Nb(c)) ncone(B- A) = (cone (B - A))® n cone (B - A) 
and 

(44) (VceC) N A (c) n (-Nb(c)) nspan(B- A) = (cone (B- A))® nspan(B- A). 
Combining (42), (43), and (44), we see that (ii)-(vii) are equivalent. 

Next, Lemma 4.1 and Corollary 4.8 yield the equivalence of (viii)-(x). 

Finally, (x)o(ii) by Corollary 4.11(i). ■ 

Corollary 4.14 (two convex sets: normal cones and interiors) Let A and B be nonempty convex sub- 
sets ofX. Then the following are equivalent: 

(i) G int(B — A). 

(ii) cone(B — A) = X. 

(iii) N A (c) n (-N B (c)) = {0} for some c G A n B. 

(iv) N A (c) n (-Nb(c)) = {0}/or ram/ c G A n B. 

(v) N A _ B (0) = {0}. 

Proof. We start by notating that if C is a convex subset of X, then G intC <S> G ri C and 
span C = X. Consequently, 

(45) (i) O G ri(B - A) and span(B - A) = X. 

Assume that (i) holds. Then (45) and Theorem 4.13 imply that cone(B — A) = cone (B — A) = 
span(B — A) = X. Hence (ii) holds, and from Theorem 4.13 we obtain that (ii)=>(iii)<=Kiv)<=Kv). 
Finally, Corollary 4.11(iii) yields the implication (v)=>(i). ■ 

5 Further examples 

In this section, we provide further examples that illustrate particularities of restricted normal 
cones. 

As announced in Remark 3.8, when a G A2 ^ A\, it is possible that the nonconvex restricted 
normal cones satisfy (a) 2 (a) even when A\ and A 2 are both convex. This lack of inclusion 
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is also known for the Mordukhovich normal cone (see [22, page 5], where however one of the sets 
is not convex). Furthermore, the following example also shows that the restricted normal cone 
cannot be derived from the Mordukhovich normal cone by the simple relativization procedure of 
intersecting with naturally associated cones and subspaces. 

Example 5.1 (lack of convexity, inclusion, and relativization) Suppose that X = R 2 , and define 
two nonempty closed convex sets by A := A\ := epi(| • |) and A2 := epi(2| • |). Then a := (0,0) G 
A2 ^ A\. Furthermore, set B := R x {0}. Then 

fR+(l,-l), ifx 2 = X!>0; 
(46a) (Vx = (x!,x 2 ) G At) N^(x) = I R+(-l,-l), if x 2 = -x x > 0; 

[{(0,0)}, otherwise, 

'R+(2,-l), ifx 2 = 2x!>0; 
R+(-2, -1), if x 2 = -2xx > 0; 
{ ( 0, ) } , otherwise . 



(46b) (Vx=(x 1 ,x 2 )GA 2 ) Nl 2 (x) = < 



Consequently, 

(47a) N^(fl) = cone {(1,-1), (-1,-1)}, 

(47b) < 2 (fl) = cone {(2,-1), (-2,-1)}. 

Note that (a) % N^(fl) and N A 2 ( a ) £ n a, ( a )' in fact ' N l t ( a ) n N A 2 ( a ) = {(°/°)}- Fur- 
thermore, neither (a) nor (a) is convex even though A\, A 2 , and B are. Finally, observe 
that cone(B - a) = span(B - a) = B, that cone(B - A) = R x R_, that span(B — A) = X, 
and that N A (a) = cone[(l, -1), (-1, -1)] ^ N%(a). Consequently, cone(B - a) n N A (a) = 
span(B - a) n N A (a) = {(0,0)}, cone(B - A) n N A (a) = N A (a) = span(B - A) n N A (a). There- 
fore, N^(fl) cannot be obtained by intersecting the Mordukhovich normal cone with one of the sets 
cone(B — a), span(B — a), cone(£> — A), and span(B — A). 

We shall present some further examples. The proof of the following result is straight-forward 
and hence omitted. 

Proposition 5.2 Let Kbea closed cone in X, and let Bbe a nonempty cone ofX. Then 



(48) Nf (0) = |J N|(x) = [J N|(x) = |J N*(x) = (J N|(x). 

XGK xebdryK xGK xebdryK 

Example 5.3 Let K be a closed convex cone in X, suppose that Uq G int(X) and that K C {uo} e > 
and set £> := {wo}^- Then: 

(i) (Vx G K n £>) N|(x) = {0}. 

(ii) (Vx G X \ B) N|(x) = N|(x) = N K (x) = K e n {x}^. 
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(iii) n|(0) ={j xeK Ni(x) = U G ^ B (^ e n{xK) = ^nU iei (xBW 1 - 

If one of these unions is closed, then all closures may be omitted. 

Proof, (i): Letx G KDB. It suffices to show that BDP^ l (x) = {x}. To this end, take y G BnP^(x). 
By definition of B, we have {uq, x) = and (uo,y) = 0. Hence 

(49) (u ,y-x>=0. 

Furthermore, x = P^y and hence, using e.g. [4, Proposition 6.27], we have y — x G K G . Since 
«o £ int K, there exists J > such that ball(«o; S) C K. Thus y — x G (ball(«o; <5)) e . In view of (49), 
£||y — x|| < 0. Therefore, y = x. 

(ii) : Let x G K \ B. Using Lemma 3.4(iii)&(iv), Corollary 4.6, Lemma 3.4(vii), and [4, Exam- 
ple 6.39], we have 

(50) N|(x) C Nf (x) C Nf (x) = N K (x) = N£ onv (x) = X e n {x}^. 

Since x G K C {« } ffi and x £ B, we have (« ,x) > 0. Now take u G (K e n {x}^) \ {0}. Since 
u € K e and «o G int(i<C), we have (u, Uq) < 0. Now set 

(51) b:=x-p4u. 

{u ,u) 

Then beB and b-x=- (u Q ,x) {u Q ,uy l u G R++« C K e n {x} 1 - = N£ onv (x). By [4, Proposi- 
tion 6.46], x = P K b. Hence & - x G Nf (x) and thus u G Nf (x). Therefore, K e n {x}^ C Nf (x). In 
view of (50), and since Nf (x) C Nf (x) C Nr(x) by Lemma 3.4(iii)&(iv), we have established (ii). 

(iii) : Combine (i), (ii), and Proposition 5.2. ■ 

Example 5.4 (ice cream cone) Suppose that X = R ra = R m_1 x R, where m G {2,3,4, . . .}, and let 
B > 0. Define the corresponding closed convex Zee cream cone by 

(52) K := jx G R m | B^x\ + • • • + x 2 m ~ < x m | , 
and set B := R m_1 x {0}. Then the following hold: 

(i) Nf(0,0) = {(0,0)}. 

(ii) N K (o 7 o) = {y g R m I rV^ + --- + yLi < -y-} = IW-^ r+(^z,-i). 

V " ' ||z||<l 

(iii) (Vz G R- 1 x {0}) Nl(z,B\\z\\) = Nf ( Z/ B||z||) = N K ( Z/ B\\z\\) = R + (Bz, -||z|j). 

(iv) Nf (0,0) = UzeR<»-i R+(jS z / _ 1)/ which is a closed cone that is not convex. 
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Proof. Clearly, K is closed and convex. Note that K is the lower level set of height of the continu- 
ous convex function 

(53) /: R m = K" 1 - 1 x R -> R: x = (z,x m ) h-> jS||z|| - x m ; 
hence , by [26, Exercise 2.5(b) and its solution on page 205], 

(54) int(X) = {x = (z,x m ) £ R" 7-1 x R | 0||z|| < x m ). 
Lemma 3.4(iii)&(iv), Corollary 4.6, and Corollary 4.11(iii) imply that 

(55) (VxGint(K)) nI(x)QN£(x)QN£(x) = N k (x) = {0}. 

Write x = (z,x m ) £ R m_1 x R = X, and assume that x £ K. We thus assume that x £ bdry(_K), i.e., 
jS||z|| = x m by (54), i.e., x = (z,j8||z||). Combining [4, Proposition 16.8] with [26, Corollary 2.9.5] 
(or [4, Lemma 26.17]) applied to /, we obtain 

(56) N K {z,p\\z\\) = cane (/3a|| • ||(z) x {-1}), 

where 9|| • || denotes the subdifferential operator from convex analysis applied to the Euclidean 
norm in R m_1 . In view of [4, Example 16.25] we thus have 

(S7\ /cone^Hzll^zxI-l}), ifz^O; 

[cone (ball(0;jS) x {-1}), ifz = 0. 

The case z = in (57) readily leads to (ii). 

Now set « := (0,1) £ R m_1 x R. Then {uq}- 1 = B and {« }® = !R m_1 x R + D _K. Note that 
(0, 0) G X n B and thus N| (0, 0) = { (0, 0) } by Example 5.3(i). We have thus established (i). 

Now assume that z ^ 0. Then N K (z,B\\z\\) = R+(Bz, -\\z\\). Note that Bz ^ and so 
(z, jS||z|| ) i. B. The formulas announced in (hi) therefore follow from Example 5.3(h). 

Next, combining (54), (55), and Example 5.3(iii) as well as utilizing the compactness of the unit 
sphere in R m_1 , we see that 



(58) N| (0,0)= |J R + Q3z,-|MI)= (J ~1)= U R+QSz,-l). 

Nl=l l|z||=l 

This establishes (iv). ■ 

Remark 5.5 Consider Example 5.4. Note that Nf (0, 0) is actually the boundary of Nk{0, 0). Fur- 
thermore, since Nk(0, 0) = N£ onv (0, 0) by Lemma 3.4(vii), the formulas in (ii) also describe K G , 
which is therefore an ice cream cone as well. 
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6 Cones containing restricted normal cones 

In this section, we provide various examples illustrating that the restricted (proximal) normal cone 
does not naturally arise by considering various natural cones containing it. 

Let A and B be nonempty subsets of X, and let a E A. We saw in Lemma 3.4(ii) that 

(59) Nl(a) = cone ((£> - a) n (P A x a - a)) C cone(B - a) n NP rox (a). 

This raises the question whether or not the inclusion in (59) is strict. It turns out and as we shall 
now illustrate, both conceivable alternatives (equality and strict inclusion) do occur. Therefore, 
N A (a) is a new construction. 

We start with a condition sufficient for equality in (59), 

Proposition 6.1 Let A and B be nonempty subsets ofX. Let A be closed and a € A. Assume that one of 
the following holds: 

(i) P A l {a) — a is a cone. 

(ii) A is convex. 

Then N\(a) = cone(B - a) n NP rox (fl). 

Proof, (i): Lemma 2.6(ii). (ii): Combine (i) with Lemma 2.7. ■ 

The next examples illustrates that equality in (59) can occur even though P^ l {a) — a is not a 
cone. Consequently, the assumption that P A l {a) — a be a cone in Proposition 6.1 is sufficient — but 
not necessary — for equality in (59). 

Example 6.2 Suppose that X = K 2 , and let A := X \ K 2 ++/ B := R+(l, 1), and a := (0, 1). Then 
one verifies that 



(60a) P A \a) -a = [0,1] x {0}, 

(60b) N p A im (a) = cone(P A 1 fl - a) = K+ x {0}, 

(60c) cone(B -a) = {(h,t 2 ) G R 2 | h > 0,f 2 < fi} U {(0,0)}, 

(60d) N|(a) =K+ x {0}. 



Hence N|(«) = R+ x {0} = cone(B - a) n NP rox (fl). 

We now provide an example where the inclusion in (59) is strict. 
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Example 6.3 Suppose that X = R 2 , let A := cane{(l,0), (0,1)} = bdryR^, B := R+(2,l), and 
a := (0, 1) G A. Then one verifies that 



(61a) 
(61b) 
(61c) 
(61d) 



P A 1 (a)-a = 1-00,1] x {0}, 

NP rox (fl) = cone(P A 1 fl — a) = R x {0}, 
cone(B-fl) = {(x 1 ,x 2 ) G R 2 | x l >0,2x 2 < x x ) U {(0,0)}, 
N» = {(0,0)}. 



Hence N| (a) = { (0, 0) } g R + x {0} = cone(B - a) n N^ * (fl), and therefore the inclusion in (59) 
is strict. In accordance with Proposition 6.1, neither is P^" 1 («) — a a cone nor is A convex. 



The inclusions in (62) are optimal in the sense that all possible combinations (strict inclusion and 
equality) can occur: 

• For results and examples illustrating equality in (62b) and equality in (62c), see Proposi- 
tion 6.5 and Example 6.6 below. 

• For an example illustrating equality in (62b) and strict inequality in (62c), see Example 6.7 
below. 

• For an example illustrating strict inequality in (62b) and equality in (62c), see Example 6.10 
below. 

• For examples illustrating strict inequality in (62b) and strict inequality in (62c), see Exam- 
ple 6.8 and Example 6.9 below. 

The remainder of this section is devoted to providing these examples. 

Proposition 6.4 Let A and B be nonempty subsets ofX. Let A be closed a € A. Assume that one of the 
following holds: 

(i) P A l (x) — xis a cone for every x G A sufficiently close to a. 

(ii) A is convex. 



Let us now turn to the restricted normal cone N A (a) . Taking the outer limit in (59) and recalling 
(14), we obtain 

(62a) N» =V^N B A (x) 
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Then (62b) holds with equality, i.e., Nj[(a) = Lkru^ (cone(£> — x) n N^ rox (x)) 

Proof. Indeed, if x G A is sufficiently close to a, then Proposition 6.1 implies that Njj (x) = cone(B — 
x) fl N^ rox (x) . Now take the outer limit as x —> a in A. ■ 

Proposition 6.5 Lef Abe a nonempty closed convex subset ofX, let Bbea nonempty subset ofX, and let 
a € A. Assume that x i-> cone(£> — x) is outer semicontinuous at a relative to A, i.e., 

(63) limcone(£> — x) = cone(£> — a), 



x— >fl 

xeA 



Then (62) holds with equalities, i.e., 

(64) Nj(a) =Ij^(cone(B-x)nNP rox (x)) = (Hmcone(B - x)) n N A (a). 



x-y« \ y ' n - ' i \ x^a 

IEA x^A 



Proof. The convexity of A and Lemma 3.4(vii) yield 

(65) cone(B - a) n N A (a) = cone(B - a) n NP rox (fl). 
On the other hand, Proposition 6.1(ii) and Lemma 3.4(iv) imply 

(66) cone(B - a) n NP rox (fl) = N|(a) C N|(a). 
Altogether, cone(B - fl) n N A (a) C N\(a). In view of (63), 

(67) (H^cone(B-x)) nN A (fl) C N^(fl). 

Recalling (62), we therefore obtain (64). ■ 

Example 6.6 Let A be a linear subspace of X, set B := A, and a := (0,0). Then N^(a) = {0} by 
(26d), N A (fl) = A 1 , and cone(B — x) = A, for every x G A. Hence (lim*^« cone(B — x) ) n N A (a) = 
{0} and (62) holds with equalities. 

In Proposition 6.5, the convexity and the outer semicontinuity assumptions are both essential in 
the sense that absence of either assumption may make the inclusion (62c) strict; we shall illustrate 
this in the next three examples. 

Example 6.7 Suppose that X = R 2 , and let A := epi(| • |), B := R x {0}, and a := (0,0). If x = 

(xi,x 2 ) G A \ {fl}, then x 2 >0, B - x = R x {-x 2 }, and so cone(B — x) = R x R U {(0,0)}. 

Hence 

(68) Iimcone(B — x) = R x R_ / Rx {0} = cone(B - a), 



x—>a 
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i.e., (63) fails. Since A is closed and convex, Lemma 3.4(vii) implies that N A (a) = N A onv (a) = —A. 
Thus 

(69) (hmcone(B - x)) D N A (fl) = —A. 

xeA 

Proposition 6.4(h) yields equality in (62b), i.e., 

(70) N B A (a) =h^(cone(B-x)nNP rox (x)). 

Already in Example 5.1 did we observe that 

(71) <(fl) = cone{(l, -1), (-1, -1)}. 
Therefore we have 

(72) N B A {a) = hrK (cone(B - x) n NP rox (x)) C (T|^cone(B - x)) D N A (a), 



xieA xieA 



i.e., the inclusion (62c) is strict. 

Example 6.8 Suppose that X = R 2 , and let A := cone{(l,0), (0,1)} = bdryR 2 ., B:=Rx {1} U 
{(1,0), ( — 1,0)}, and a := (0,0). Clearly, A is not convex. If x = (xi,x 2 ) G A is sufficiently close 
to a, we have 



(73) cone(B - x) = 



R x R + , if xi > 0; 

R x R ++ Ucone{(l / -x 2 ) / (-l,-x 2 )} / if x 2 > 0. 



This yields 

(74) mncone(B-x) =RxR+ = cone(B-a), 



xeA 



i.e., (63) holds. Next, if x = {x\, x 2 ) G A, then 

{{xi } x ] — oo, xi] , if Xi > and x 2 = 0; 
]-oo,x 2 ] x {x 2 }, if xi = 0andx 2 > 0; 
R 2 , ifxi = x 2 = 0, 

and so 

' {0} x R, if xi > and x 2 = 0; 

(76) N p A Tm {x) = cone {P A \x) - x) = J R x {0}, if x x = and x 2 > 0; 

R 2 , ifxi = x 2 = 0. 

It follows that 

(77) N A (a) = h^Nl mx (x) = R 2 U ({0} x R) U (R x {0}). 



x— >a 
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If x G A is sufficiently close a, then 

(78) N*(x) = 



{(0,0)}, ifx^a; 
R_ x {0}, i£x = a. 



It follows that 

(79) Nl(a) = R_ x {0}. 

Combining (73) and (76), we obtain for every x = (x\, x 2 ) £ A sufficiently close to a that 



(80) 

Thus 
(81) 



cone(B-x)nNP rox (x) = < 



' {0} x R + , if xi > and x 2 = 0; 
{(0,0)}, if xi = 0andx 2 > 0; 
R x {0}, ifxi = x 2 = 0. 



lim (cone(B - x) n NP rox (x)) = ({0} x R+) U (R_ x {0}). 



x— >a 



Using (79), (81), (74), and (77), we conclude that 

(82a) Ng(a) = R- x {0} 

(82b) 

(82c) 



C ({0} x R+) U (R_ x {0}) = lim (cone(B - x) n NP rox (x)) 
C ({0} x R+) U (R x {0}) = (li^cone(B - x)) n N A (a). 



Therefore, both inclusions in (62) are strict; however, A is not convex while (63) does hold. 

Example 6.9 Suppose that X = R 2 , let A := cone{(l,0), (0,1)} = bdryR^, B := R+(2,l) and 
a := (0,0). Let x = (xi,x 2 ) G A. Then (see Example 6.8) 



(83) 



P A \x)-x = { 



{0} x ] — oo, xi] , if xi > and x 2 = 0; 

oo, x 2 ] x {0}, if Xi = and x 2 > 0; 
R 2 ., ifxi = x 2 = 0, 



(84) 



< ox (x) = 



' {0} x R, if x 1 > and x 2 = 0; 
R x {0}, if xj = and x 2 > 0; 
R 2 , if xi = x 2 = 0, 



and 
(85) 



N A (a) = limNP rox (x) = R 2 U ({0} x R) U (R x {0}). 



x— >a 
xeA 
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Thus 



or/ s //„ i/ n n nn f{0}xR + , if > and x 2 = 0; 

[{(0,0)}, if xx = and x 2 > 0. 

Hence 

(87) N>)=EENj(x) = {0}xR + . 

On the other hand, 



xeA 



(88) cone(B - x) = < 
Combining (84) and (88), we deduce that 

(89) cone(B - x) n NP rox (x) = < 
Using (88) and (89), we compute 



'{(yi/i/2) | t/2 >0,yi < 2y 2 } U{(0,0)}, ifx x > and x 2 = 0; 
{(yi'3/2) | yi > 0, 2y 2 < j/j} U {(0,0)}, if x x = and x 2 > 0; 
k B, if Xi = x 2 = 0. 



' {0} x R + , if x x > and x 2 = 0; 
R+ x {0}, if xi = and x 2 > 0; 
[{(0,0)}, ifxi = x 2 = 0. 



(90) limcone(B - x) = {(1/1,1/2) | yi > or i/ 2 > 0} = X \ R__ ^ B = cone(B - a) 



and 

(91) Ii^(cone(B-x) nNP rox (x)) = ({0} x R+) U (R+ x {0}) = cone{(0,l), (1,0)}. 



Using (87), (91), (90), and (85), we conclude that 
(92a) Nj(a) = {0}xR+ 

(92b) C ({0} x R + ) U (R+ x {0}) = Ih^ ( cone(B - x) n < ox (x)) 

(92c) C ({0} x R) U (R x {0}) = (IirKcone(B - x)) D N A (fl). 



xtEA 



Therefore, both inclusions in (62) are strict; however, A is not convex and (63) does not hold (see 
(90)). 

Finally, we provide an example where the inclusion (62b) is strict while the inclusion (62c) is an 
equality. 
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Example 6.10 Suppose that X = R 2 , let A := cone{(l,0), (0,1)}, B := { (1/1,3/2) | J/i +1/2 = l}, 
and fl := (0,0). Let x = (xi,X2) £ A be sufficiently close to a. We compute 

(93a) cone(B - x) 

(93b) NP rox (x) 

(93c) N B A (x) 

Furthermore, Example 6.8 (see (77)) implies that N A (a) = R 2 U ({0} x R) U (R x {0}) . We thus 
deduce that 

(94a) N» = {(0,0)} 

(94b) C ({0} x R + ) U (R+ x {0}) = Hm (cone(B - x) n NP rox (x)) 

xeA 

(94c) = ({0} x R+) U (R+ x {0}) = (H^cone(B - x)) n N A (a). 

Therefore, the inclusion (62b) is strict while the inclusion (62c) is an equality. 

7 Constraint qualification conditions and numbers 

Utilizing restricted normal cones, we introduce in this section the notions of CQ-number, joint-CQ- 
number, CQ condition, and joint-CQ condition, where CQ stands for "constraint qualification". 



= {{yi,V2) |yi+y 2 >o}u{(o,o)}, 

{{0} x R, if x x > and x 2 = 0; 
R x {0}, if xi = and x 2 > 0; 
R 2 , ifxi = x 2 = 0, 

= {(0,0)}. 



CQ and joint-CQ numbers 

Definition 7.1 (CQ-number) Let A, A, B, B, be nonempty subsets ofX, let c € X, and let 5 £ R++. 
The CQ-number at c associated with (A, A, B, B) and 5 is 



(95) e s :=e s {A, A, B,B):= sup <^ (u, 



u G Nl{a),v G -N$(b), \\u\\ < 1, \\v\\ < 1, 
\\a — ell < 5, \\b — ell < 5. 



The limiting CQ-number at c associated with (A, A, B, B) is 

(96) 9 := 9(A, A, B,B) := lim S (A, A, B, B) . 



Clearly, 

(97) 9 S (A, A, B, B) = 9 S (B, B, A, A) and 9(A, A, B, B) = 0(B, B, A, A) . 
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Note that, 5 h-> 9s is increasing; this makes 9 well defined. Furthermore, since belongs to 
nonempty B-restricted proximal normal cones and because of the Cauchy-Schwarz inequality, 
we have 

(98) c G A n B and < Si < S 2 =3- < < Q h < % < 1, 

while 9s, and hence 9, is equal to— ooifc £ A n B and £ is sufficiently small (using the fact that 
sup = — oo). Using Proposition 3.7(ii)&(vi), we see that 

(99) A C A' and KB' =» ^(A, A, B, B) < S (A, A', B, B 1 ) 
and, for every x G X, 

(100) ^( A 'A'' B 'B') ate = ^( A-X 'A'- X ' B - X '!!- X ) at c - x. 

To deal with unions, it is convenient to extend this notion as follows. 

Definition 7.2 (joint-CQ-number) Let A := (A;); e ;, A := (A ; -), e j, B := (B ; -)y e/ , 6 := (B ; -)y €/ fee 

nontrivial collections 1 of nonempty subsets ofX, let c € X, and let 5 G TTze joint-CQ-number at c 

associated with (A,A,B,B) and 3 is 

(101) 9 S = 9 S (A, A, B,B) := sup ^ ( A { , A ir B jr By) , 

(f,;)€lx/ 

and f/ie limiting joint-CQ-number at c associated with {A, A, B,B) is 

(102) = 6(A, A, H,jB) := lim ^ (A, A, 5, H) . 

For convenience, we will simply write 9s, 9 and omit the possible arguments (A, A, B, B) and 
(A,A,B,B) when there is no cause for confusion. If I and / are singletons, then the notions of 
CQ-number and joint-CQ-number coincide. Also observe that 

(103) c G |J A { n |J By => (W G R++) < 6 < 6 S < 1 

while = = — oo when £ > is sufficiently small and c does not belong to both \J iGl Aj 
and Uy € j By. Furthermore, the joint-CQ-number (and hence the limiting joint-CQ-number as well) 
really depends only on those sets A ; - and By for which c G A; fl By. 

To illustrate this notion, let us compute the CQ-number of two lines. The formula provided 
is the cosine of the angle between the two lines — as we shall see in Theorem 8.12 below, this 
happens actually for all linear subspaces although then the angle must be defined appropriately 
and the proof is more involved. 

x The collection (A,), e / is said to be nontrivial if 7 ^ 0. 
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Proposition 7.3 (CQ-number of two distinct lines through the origin) Suppose that w a and w\, are 

two vectors in X such that \\w a \\ = = 1. Let A := lRw a , B := Ra^, and 3 G R++. Assume that 

A n B = {0}. Then the CQ-number at is 

(104) e 5 {A,A,B,B) = \{w a ,w b )\. 
Proof. Sets := (w a ,w h ). 

Assume first that s ^ 0. Let a = ocw a G Aandb = Bw h G B. ThenP^ 1 (a) — a = Na(a) = {w a } ± ', 
considering (B — a) n {w a } L leads to Bs = a. Hence (P^ 1 (fl) — a) D(B — a) = Bwy — otw a and 

(105) ~N\{a) = cone (as^Wb — aw a ). 
Similarly, 

(106) -N$(b) = cone (Bw b - /3s _1 07 fl ). 

Now set u := as -1 ^ — cav a G N^(fl) and v := — Bs~ x w a G —N§(b). One computes 

(107) \\u\\ = 1 1 ; , , \\v\\ = iri , , , and (u,v) = — -. 

\s\ \s\ s 

Hence 

(108) „ (" ,V ) „ = sgn(a) sgn(S)s. 

Il u ll ' \\ v \\ 

Choosing a. and B in { — 1, 1 } appropriately, we arrange for (u,v) / ( || m || • 1 1 u 1 1 ) = | s | , as claimed. 
Now assume that s = 0. Arguing similarly, we see that 

(109) fiJW-jf' ""^ and (V* eB ) fJ^) = {<° } < '"^ 

I B, if a = 0, I A, if o = 0. 

This leads to 6 s (A, A, B, B) = = |s|, again as claimed. ■ 

Let A := (Ai)i e i, A := (A^-gj, B := (B/)/6/ an d £> := (By)y e j be nontrivial collections of 
nonempty closed subsets of X and let ^ G R++. Set A := A[, A := \J iG[ A y B := \Jj e jBj, 
B := (J/ej Bj, and suppose that c G A n B. It is interesting to compare the joint-CQ-number of 

collections, i.e., 0$ [A, A, B, B) , to the CQ-number of the unions, i.e., 6 s (A, A, B, £>) . We shall see in 
the following two examples that neither of them is smaller than the other; in fact, one of them can be 
equal to 1 while the other is strictly less than 1. 

Example 7.4 (joint-CQ-number < CQ-number of the unions) Suppose that X = R 3 , let I := 

/ := {1,2}, Ai := R(0,1,0), A 2 := R(2,0,-l), B x := R(0,1,1), B 2 := R(1,0,0), c := (0,0,0), 
and let 3 > 0. Furthermore, set .4 := (A ; -) ; - eZ/ £ := (B ; ) ;e/ , A := A x U A 2 , and B := B x U B 2 . Then 

(110) 6 S (A, A, 6, B) = j- < 1 = 6 S (A, A, B, B) . 
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1 

Vw' 



Proof. Using Proposition 7.3, we compute, for the reference point c, 

(Ula) 9 s (A 1 ,A 1 ,B 1 ,Bi) = | <(0, 1,0), ^(0, 1, 1)> | = j-, 

(Hlb) 9 S (A 1 ,A 1 ,B 2 ,B 2 ) = | ((0,1,0), (1,0,0)) | = 0, 

(Ulc) 9 S (A 2/ A 2/ B 1 ,B 1 ) = 1(^(2,0, -1),^(0,1,1)>| = 

(Hid) e 5 {A 2r A 2r B 2 ,B 2 ) = 1(^(2,0,-1), (1,0,0>| = j.. 

Hence 9 S (A, A, B,B) = max {i/j)GlxJ 9 s (A i/ At,Bj,Bj) = < 1. 
To estimate the CQ-number of the union, set 

(112) a := (0, 5, 0) G A x C A and b := (<S, 0, 0)eB 2 C B. 
Note that ||a — c|| = ||fl| = J and \\b — c\\ = \\b\\ = 5. Now define 

(113) a := (S, 0, - J/2) G A 2 C A and b := (0, J, <5) G Bj C B. 

Since ||a — Pb 2 0|| < Pb^II an d Pb 2 « = b, we have £> = Pga. Since ||b — 1 1 < \\b — Pa 2 ^II an d 
P Ai fr = a, we have a = P A b. Therefore, G B n P A 1 (a) and a G A n P B 1 (&). It follows that 



(114a) u := j(b — a) = (0, 0, 1) G N^(fl), 

(114b) := |(b -S) = (0,0,1) G -N$(b 

Since ||w|| = ||u|| = 1, we obtain 1 = (u,v) < 9s(A,A,B,B) < 1. 



Example 7.5 (CQ-number of the unions < joint-CQ-number) Suppose that X = R, let I := / := 

{1,2}, Ai := B x := R , A 2 := B 2 := R+, c := 0, and 5 > 0. Furthermore, set .4 := (A ; -); 6 i/ 
£ := (Bj) j€ i, A := Ai U A 2 = R, and B := Bi U B 2 = R. Then 

(115) 6> A (A, A, B, B) = < 1 = 0,- (A, A, B, B) . 

Proof. Lemma 3.4(viii) implies that (Vx G R) Njf (x) = {0}. Hence ^(R,R,R,R) = as claimed. 
On the other hand, Nj^(0) = R and Nr + (0) = R+. Hence (J (R_,R_ / R +/ R+) = 1 and 
therefore 9 S (A, A, 6, B) =1 as well. ■ 

The two preceding examples illustrated the independence of the two types of CQ-numbers (for 
the collection and for the union). In some cases, such as Example 7.4, it is beneficial to work with 
a suitable partition to obtain a CQ-number that is less than one, which in turn is very desirable in 
applications (see Section 10). 



CQ and joint-CQ conditions 

Definition 7.6 (CQ and joint-CQ conditions) Let c G X. 
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(i) Let A, A, B and B be nonempty subsets ofX. Then the (A, A, B, B)-CQ condition holds at c if 

(116) Nf(c)n(-A#(c)) C{0}. 

(ii) Let A := (A,-)j € j, A := (A,-),- e j, £> := (Bj)j € j and B := (By)y e j be nontrivial collections of 
nonempty subsets of X. Then the (A, A,B,B)-)omt-CQ condition holds at c if for every (i,j) G 
I x J, the (Ai, A[, Bj, Bj)-CQ condition holds at c, i.e., 

(117) (V(ii)eix/) Nj(c)n(-N^( C ))C{0}. 



In view of the definitions, the key case to consider is when c G A n B (or when c G A,- fl By in 
the joint-CQ case). The CQ-number is based on the behavior of the restricted proximal normal 
cone in a neighborhood of the point under consideration — a related notion is that of the exact 
CQ-number, where we consider the restricted normal cone at the point instead of nearby restricted 
proximal normal cones. 

Definition 7.7 (exact CQ-number and exact joint-CQ-number) Let c £ X. 

(i) Let A, A, B and B be nonempty subsets of X. The exact CQ-number at c associated with 
(A,A,B,B)is 2 

(118) a := a(A, A, B, B) := sup u G N%(c),v G -N§(c),\\u\\ < l,\\v\\ < lj. 

(ii) Let A := (A,),- G 7, A := (A,) !G j, B := (B ; )y G j and B := (B,)y e j be nontrivial collections of 
nonempty subsets ofX. The exact joint-CQ-number at c associated with (A, B, A, B) is 

(119) a := a(A, A, B, B) := sup a(A ir A if Bj, By). 

(ij)eixj 



The next result relates the various condition numbers defined above. 

Theorem 7.8 Let A := (A ; ) !G 7, A := (A;)/ 6 j, B := (B ; )y G j and B := (B,-)y € ; fee nontrivial collections 
of nonempty subsets of X. Set A := (J;e/ A B := (Jy G j By, fl«d suppose that c G A n B. Denote 
the exact joint-CQ-number^at c associated with (A, A, B, B) by oc (see (119)), the joint-CQ-number at c 
associated with (A, A, B, B) and 5 > Oby 6 S (see (101)), and the limiting joint-CQ-number at c associated 
with {A, A,B,B) by 6 (see (102)). Then the following hold: 

(i) If a. < 1, then the {A, A, B, B)-CQ condition holds at c. 

(ii) W < 9$. 

2 Note that if c £ A n B, then a = sup = -co. 
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(iii) a < 9. 

Now assume in addition that I and J are finite. Then the following hold: 

(iv) a = 6. 

(v) The (A, A, B, B)-joint-CQ condition holds at c if and only if a = 9 < 1. 

Proof, (i): Suppose that a < 1. The condition for equality in the Cauchy-Schwarz inequality 
implies that for all (i,j) G I x J, the intersection Nj.(c) D (— N B '(c)) is either empty or {0}. In 
view of Definition 7.6, we see that the (A, A, B, £>)-joint-CQ holds at c. 

(ii) : Let G I x /. Take u G AT^(c) and v G -Ng '(c) such that ||w|| < 1 and \\v\\ < 1. Then, 
by definition of the restricted normal cone, there exist sequences (« n )neN m A'/ (^n)neN m By, 
(mjj)jjgn and (z>n)neN m X such that a n — > c, £>„ — > c, m„ — > «, i>„ — > v, and (Vn G N) «„ G N A ; (a n ) 
and y„ G —N^{b n ). Note that since > 0, eventually a„ and b„ lie in ball(c;£); consequently, 

(u n , v n ) < 6$(Ai, A[, By, Bf). Taking the limit as n — » +co, we obtain (w, v) < 6s(Ai, A[, Bj, By) < 9$. 
Now taking the supremum over suitable u and v, followed by taking the supremum over we 
conclude that a < 9$. 

(iii) : This is clear from (ii) and (102). 

(iv) : Let (<5„) nG]N be a sequence in R ++ such that S n -¥ 0. Then for every n G N, there exist 

(120) i n G I, jn e],a n e A in , b n G B jn , u n G nJJ* («„), z>„ G -N^" (&„) 
such that 

(121) \\a n - c\\ < S n , \\b n - c\\ < S n , \\u n \\ < 1, \\v n \\ < 1, and (u n ,v n ) > 9 Sn - 5„. 

Since I and / are finite, and after passing to a subsequence and relabeling if necessary, we can and 

do assume that there exists (i,f) G I x / such that u„ — > w G N A ; . (c) and u„ — > v G — N B ' (c). Hence 

9 <— 0<5 (i — 5 n < (u n , v n ) —> (u, v) < a. Hence 9 < a. On the other hand, a < 9 by (iii). Altogether, 
a = 9. 

(v) : "=>": Let 6 Ix J. If c £ A{ n By, then a (A;, A,, By, By) = -oo. Now assume that 

c G Aj n By. Since the (A,A,B, H)-joint-CQ condition holds, we have nJJ(c) D -nJ(c) = {0}. 
By Cauchy-Schwarz, 

(122) ci(Ai,Ai,Bj,Bj) = sup j(u,y) u G N^(c),u G -N^'(c), \\u\\ <l,\\v\\ < 1 j < 1. 
Since 7 and / are finite and because of (iv), we deduce that = a < 1. 

"<(=": Combine (i) with (iv). ■ 
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8 CQ conditions and CQ numbers: examples 

In this section, we provide further results and examples illustrating CQ conditions and CQ num- 
bers. 

First, let us note that the assumption that the sets of indices be finite in Theorem 7.8(iv) is 
essential: 

Example 8.1 (a < 6) Suppose that X = R 2 , let Y C R + + be such that supT = +co, set (V7 G Y) 
A y := epi(|-y| • | 2 ), B:=R + xK,i := (A 7 ) 7er , A := (X) 7€r , B := (B), B := (X), and 
c := (0,0). Denote the exact joint-CQ-number at c associated with (A,A,B,B) by a. (see (119)), 
the joint-CQ-number at c associated with ( A, A, B, B) and 5 > by 9$ (see (101)), and the limiting 
joint-CQ-number at c associated with {A, A, B, B) by 9 (see (102)). Then 

(123) a = o<i = e s = e. 



Proof. Let 7 G T and pick x > such that a := (x, \"yx 2 ) G A 7 satisfies = \\a — c\\ = 5, i.e., 
x > and 

(124) j 2 x 2 = 2^1 + 7 2 ^ 2 -l) ->■ +co as 7 ^+00 in T. 
Hence 

(125) 7X — > +00, as 7 -4 +00 in T. 
Since A 7 is closed and convex, it follows from Lemma 3.4(vii) that 

(126) u := G R+(7X, -1) = N£ n » = N^(«) = N*(«) = N^(fl). 



Furthermore, v := (1,0) G — (R_ x {0}) = -Nf(c) = -Nf(c) = -N B (c), \\u\\ = \\v\\ = 1, and, 
in view of (125), 

(127a) 1 > 9 S > 9 g ( A 7 , X, B, X) > <«, v) - " fX 



Vj 2 x 2 + 1 

(127b) -4 1 as 7 -4 +00 in T. 

Thus 9 5 = 1, which implies that 9 = 1. Finally, Na 7 (c) = ({0} x R_) _L (R+ x {0}) = -N B (c), 
which shows that a = 0. ■ 

For the eventual application of these results to the method of alternating projections, the condi- 
tion a = 9 < 1 is critical to ensure linear convergence. 

The following example illustrates that the CQ-number can be interpreted as a quantification of 
the CQ condition. 
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Example 8.2 (CQ-number quantifies CQ condition) Let A and B be subsets of X, and suppose 
that c E A n B. Let L be an affine subspace of X containing A U £>. Then the following are equiva- 
lent: 

(i) N£(c) n (-N£(c)) = {0}, i.e., the (A, L, B, L)-CQ condition holds at c (see (116)). 

(ii) N A (c)n(-N B (c))n(L-c) = {0}. 

(iii) < 1, where 9 is the limiting CQ-number at c associated with ( A, L, B, V) (see (96)). 

Proof. The identity (26d) of Theorem 4.5 yields N^(c) = N A (c) n (L - c) and N|(c) = N B (c) n 
(L — c). Hence 

(128) <(c) n ( - N£(c)) = N A (c) n ( - N B (c)) n (L - c), 

and the equivalence of (i) and (ii) is now clear. Finally Theorem 7.8(iv)&(v) yields the equivalence 
of (i) and (iii). ■ 

Depending on the choice of the restricting sets A and B, the {A, A, B,B )-CQ condition may 
either hold or fail: 

Example 8.3 (CQ condition depends on restricting sets) Suppose that X = R 2 , and set A := 

epi(| • |), B := R x {0}, and c := (0,0). Then we readily verify that N A (c) = N%(c) = -A, 
Nl(c) = -bdryA, N B (c) = N$(c) = {0} x R, and N$(c) = {0} x R + . Consequently 

(129) N*(c) n ( - N$(c)) = {0} x R_ while N B A (c) n ( - N#(c)) = {(0,0)}. 
Therefore, the (A, A, B, B)-CQ condition holds, yet the (A, X, B, X)-CQ condition fails. 

For two spheres, it is possible to quantify the convergence of 9s to S = a: 

Proposition 8.4 (CQ-numbers of two spheres) Let z\ and Z2 be in X, let pi and p2 be in R++, set 
Si := sphere(zi;pi) and S2 := sphere (22; pi) and assume that c G Si n S2. Denote the limiting 
CQ-number at c associated with (Si,X, S2, X) by 9 (see Definition 7.1), and the exact CQ-number at c 
associated with (Si, X, S2, X) by a (see Definition 7.7). Then the following hold: 

(1) = a = — — . 

P1P2 

(ii) a < 1 unless the spheres are identical or intersect only at c. 



Now assume that oc < 1, let e e R++, and set 6 := {y/(pi+ pi) 1 +4j0ip2£ — (pi + P2))/2 > 0. Then 

(130) a < S < a + e, 

where 9s is the CQ-number at c associated with (Si, X, S2, X) (see Definition 7.1). 
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Proof, (i): This follows from Theorem 7.8(iv) and Example 3.6. 

(ii): Combine (i) with the characterization of equality in the Cauchy-Schwarz inequality. 

Let us now establish (130). By Theorem 7.8(ii), we have a < 6s- Let Si G Si be such that 
— c\\ < 8, let U\ G N|| (si) be such that ||«i|| = 1, let S2 G S 2 be such that ||s2 — c\\ < 5, and let 
«2 G Nf 2 (s 2 ) be such that |j«2|| = 1. By Example 3.6, 

/iqi\ i si-zi ,s\-zi s 2 -z 2 ,s 2 -z 2 
(131) Mi = if, ry = ± and «2 = ±71 rr = ± " 



|Sl-Zi|| pi ||S2-Z2|| ^2 



Hence 



(132a) pip 2 {u x , u 2 ) < I (si - zi, s 2 - z 2 ) | 

(132b) = I <( Sl - c) + (c - zi), (s 2 -c) + (c- z 2 )> 

(132c) < I (si — c, S2 — c) I + I (si — c, c — Z2) I 

(132d) + I (c - zi, s 2 - c) I + I (c - zi, c - z 2 ) | 

(132e) < ^ 2 + ^ (pi + p 2 ) + pipici 

and thus, using the definition of 5, 

(133) < M i, u 2 ) < « + ^ + %1+P2) = « + e. 

P1P2 

Therefore, by the definition of #,5, we have 6?^ < a + e. 



Two convex sets 

Let us turn to the classical convex setting. We start by noting that well known constraint qualifi- 
cations are conveniently characterized using our CQ conditions. 

Proposition 8.5 Let A and B be nonempty convex subsets of X such that ADB ^ 0, and set L — 
aff (A U B). Then the following are equivalent: 

(i) riA DhB ^ 0. 

(ii) The (A, L, B, L)-CQ condition holds at some point in ADB. 

(iii) The (A, L, B, L)-CQ condition holds at every point in ADB. 

Proof. This is clear from Theorem 4.13. ■ 

Proposition 8.6 Let A and B be nonempty convex subsets ofX such that A n B ^ 0. Then the following 
are equivalent: 
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(i) G int(B - A). 

(ii) The (A, X, B, X)-CQ condition holds at some point in AnB. 

(iii) The (A, X, B, X)-CQ condition holds at every point in AnB. 

Proof. This is clear from Corollary 4.14. ■ 

In stark contrast to Proposition 8.5 and 8.6, if the restricting sets are not both equal to L or to X, 
then the CQ-condition may actually depend on the reference point as we shall illustrate now: 

Example 8.7 (CQ condition depends on the reference point) Suppose that X = R 2 , and let 

/: R -4 R: x i-> (max{0, x}) 2 , which is a continuous convex function. Set A := epi/ and 
B := R x {0}, which are closed convex subsets of X. Consider first the point c := (—1, 0) 6 AnB. 
Then N|(c) = {(0,0)} and N$(c) = {0} x R + ; hence, 

(134) <(c)n(-N$(c)) = {(0,0)}, 

i.e., the (A, A, B, B)-CQ condition holds at c. On the other hand, consider now d := (0, 0) G A n B. 
Then N%(d) = {0} x R and N$(d) = {0} x R + ; thus, 

(135) N B A (d) n ( - N$(d)) = {0} x R_, 
i.e., the {A, A, B, B)-CQ condition fails at d. 



Two linear (or intersecting affine) subspaces 

We specialize further to two linear subspaces of X. A pleasing connection between CQ-number 
and the angle between two linear subspaces will be revealed. But first we provide some auxiliary 
results. 

Proposition 8.8 Let A and B he linear subspaces ofX, and let 5 G R++. Then 

(136) |J N»= |J N»= \jN B A (a)=A ± n(A + B). 

flGAn(B+A ± )nball(0;<5) fl€Anball(0;<5) aeA 

Proof. Let a G A. Then P A x {a) = a + A 1 - and hence P A l {a) - a = A ± . If B n (a + A 1 ) = 0, then 
N B (a) = {0}. Thus we assume that B n (a + A L ) ^ 0, which is equivalent to a G A n (B + A L ). 
Next, by Lemma 3.4(h), N A (a) = A ± n cone(B - a). This implies (VA G R++) cone(B - Aa) = 
cone(A(B — a)) = cone(B — a). Thus, 

(137) (VAGR++) N A (Aa) = A ± n cone{B - Aa) = A ± D cone(B - a) = N|(a). 
This establishes not only the first two equalities in (136) but also the third because 

(138a) (J N|(fl) = |J (A 1 n cone(B - a)) = A L n |J cone(B - a) 

aeA «GA «ga 
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(138b) = A JL n cone ( |J (B - a)) = A^ n cone(B - A) = A L n (B - A) 

flGA 

(138c) = A 1 n (B + A). 

The proof is complete. ■ 

We now introduce two notions of angles between subspaces; for further information, we highly 
recommend [10] and [11]. 

Definition 8.9 Let A and B be linear subspaces ofX. 

(i) (Dixmier angle) [15] The Dixmier angle between A and B is the number in [0, whose cosine is 
given by 

(139) cq{A,B) := sup{| (a,b) \ \ a G A,b G B, < 1, ||b|| < l}. 

(ii) (Friedrichs angle) [16] The Friedrichs angle (or simply the angle) between A and B is the number 
in [0, j] whose cosine is given by 

(140a) c(A, B) := c (A n (A n B)\ B n (A n B)^) 

a e An (AnB) 1 , < 1/ 
b G B n (A n B)- 1 , \\b\\ < 1 



(140b) = sup | \(a,b) 



Let us gather some properties of angles. 
Fact 8.10 Let A and B be linear subspaces ofX. Then the following hold: 

(i) If A n B = {0}, then c(A,B) = c (A,B). 

(ii) If A DB ^ {0}, then c (A, B) = 1. 

(iii) c(A,B) < 1. 

(iv) c(A,B) = c (A,Bn (ARB) 1 ) = c (A n (A n B) ± / B). 

(v) (Solmon) c(A,B) = c(A ± / B ± ). 

Proof, (i)-(iii): Clear from the definitions, (iv): See, e.g., [10, Lemma 2.10(1)] or [11, Lemma 9.5]. 
(v): See, e.g., [10, Theorem 2.16]. ■ 

Proposition 8.11 (CQ-number of two linear subspaces and Dixmier angle) Let A and B be linear 
subspaces ofX, and let 5 > 0. Then 

(141a) 6s{A, A,B,B) = cq[A l n (A + B),B L n (A + B)), 

(141b) e s {A,X,B,B) = c (A ± n(A + B),B ± ), 

(141c) 6 S (A, A, B, X) = c (A\ B ± n(A + B)) t 

where the CQ-numbers at are defined as in (95). 
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Proof. This follows from Proposition 8.8. ■ 

We are now in a position to derive a striking connection between the CQ-number and the 
Friedrichs angle, which underlines a possible interpretation of the CQ-number as a generalized 
Friedrichs angle between two sets. 

Theorem 8.12 (CQ-number of two linear subspaces and Friedrichs angle) Let A and B be linear 
subs-paces ofX, and let 5 > 0. Then 

(142) 9 S (A,A,B,B) = 9 S (A,X,B,B) = 9 S (A / A,B,X) = c(A, B) < 1, 

where the CQ-number at is defined as in (95). 

Proof. On the one hand, using Fact 8.10(v), we have 
(143a) c(A,B)=c(A ± ,B ± ) 

(143b) = Co (A 1 - n (A 1 - n B- 1 )- 1 , B L n (A 1 - n B^) 

(143c) = c (A ± n (A + B),B ± n (A + B)). 

On the other hand, Fact 8.10(iv) yields 

(144a) c (A ± n(A + B),B ± ) =c (yl 1 n(A 1 nB 1 ) 1 ^ 1 ) 

(144b) =c(A L ,B L ) 
(144c) =c (A 1 ,B i n(A 1 n8 1 ) 1 ) 
(144d) =c (A ± ,B ± n(A + B)). 

Altogether, recalling Proposition 8.11, we obtain the result. ■ 

The results in this subsection have a simple generalization to intersecting affine subspaces. In- 
deed, if A and B are intersecting affine subspaces, then the corresponding Friedrichs angle is 

(145) c(A,B) := c(par A, par B). 

Combining (100) with Theorem 8.12, we immediately obtain the following result. 

Corollary 8.13 (CQ-number of two intersecting affine subspaces and Friedrichs angle) Let A and 

B be affine subspaces ofX, suppose that c e A n B, and let 5 > 0. Then 

(146) 9 S (A,A,B,B) = 9 5 {A,X / B,B) = 9 S (A / A,B,X) = c(A,B) < 1, 
where the CQ-number at c is defined as in (95). 
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9 Regularities 

In this section, we study a notion of set regularity that is based on restricted normal cones. 
Definition 9.1 (regularity and superregularity) Let A and B be nonempty subsets ofX, and let c G X. 

(i) We say that B is (A, e, 5) -regular at c G X if e > 0, S > 0, and 



IfB is (X, e, 5)-regular at c, then we also simply speak of (e, 5) -regularity. 

(ii) The set B is called A-superregular at c G X if for every e > there exists 5 > such that B is 
{A, e, 5)-regular at c. Again, ifB is X-superregular at c, then we also say that B is superregular at c. 

Remark 9.2 Several comments on Definition 9.1 are in order. 

(i) Superregularity with A = X was introduced by Lewis, Luke and Malick in [17, Section 4]. 
Among other things, they point out that amenability and prox regularity are sufficient con- 
ditions for superregularity, while Clarke regularity is a necessary condition. 

(ii) The reference point c does not have to belong to B.lic G' B, then for every 5 G ]0, rfg (c) [, B is 
(0, £)-regular at c; consequently, B is superregular at c. 

(iii) If ei > £2 and B is (A, £2, £)-regular at c then B is also (A, t\, ^)-regular at c. 

(iv) If £ G [1, +00 [, then Cauchy-Schwarz implies that B is (£, +oo)-regular at every point in X. 

(v) It follows from Proposition 3.7(ii) that B is ( A\ U A2, £, (S)-regular at c if and only if B is both 
(Ai, £, (5)-regular and (A2, £, <5)-regular at c. 

(vi) If B is convex, then it follows with Lemma 3.4(vii) that B is (A, 0, +co)-regular at c; conse- 
quently, B is superregular. 

(vii) Similarly, if B is locally convex at c, i.e., there exists p G IR++ such that B fl ball(c;p) is 
convex, then B is superregular at c. 

(viii) If B is {A, 0, 5) -regular at c, then B is A-superregular at c; the converse, however, is not true 
in general (see Example 9.3 below). 

As a first example, let us consider the sphere. 

Example 9.3 (sphere) Let z G X and p G Set S := sphere(z; p), suppose that s G S, let 

£ G and let <5 G Then S is (£,/0£) -regular at s; consequently, S is superregular at s (see 

Definition 9.1). However, S is not (0, 5) -regular at s. 



(147) 



(y, b) G B x B, 
||y — c|| < ||& — c|| 
w g N$(b) 




=> (m,i/ — b) < e||w|| • ||y — b 
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Proof. Let b G S and y G S. Then p 2 = ||z-y|| 2 = ||z - b|| 2 + \\y - b|| 2 - 2 (z - b,y - b) = 
P 2 + lly ~~ HI 2 ~~ 2 (z — b,y — b), which implies 

(148) 2{z-b,y-b) = \\y-b\\\ 
On the other hand, by Example 3.6, we have 

(149) N$ (b) nsphere(0;l) = | ± j = j ± ^p}- 
Suppose that m G Nf(b) fl sphere(0; 1). Combining (148) and (149), we obtain 

(150) (N s x (b)nsphere(0;l),y-fc) = { ± l||y - b|| 2 }. 
Thus if ||y — s|| < pe, \\b — s\\ < pe, and u G Nf(b) n sphere(0; 1), then 

(151) («,y - b) < l||y - b|| 2 < l(||y - s|| + ||s - b||) ||y - b\\ < ~ H 

(152) =e||u||-||y-b||, 

which verifies the {e,pe) -regularity of S at s. Finally, by (150), 

(153) max{(Nf(b)nsphere(0;l),y-b)} = -^||y-&|| 2 >0 

and therefore S is not (0, S) -regular at s. ■ 

We now characterizes A-superregularity using restricted normal cones. 

Theorem 9.4 (characterization of A-superregularity) Let A and B be nonempty subsets of X, and let 
c G X. Then B is A-superregular at c if and only if for every e G K.++, there exists 3 G R++ such that 

(y,b) G B x B ^ 

(154) ||y - c|| < 8, \\b — c\\<5 > =^ (u,y -b) < e\\u\\ ■ \\y -b\\. 

ueN£(b) ' J 

Proof. "<£=": Clear from Lemma 3.4(iv). "=>": We argue by contradiction; thus, we assume there 
exists e G K++ and sequences (y n , b n , u n ) n ^ in B x B x X such that (y n , b n ) — > (c, c) and for every 
n G N, 

(155) u n eN§(b„) and (u„,y„ - b„) > e||i/„|| • ||y„ - b n \\. 

By the definition of the restricted normal cone, for every n G N, there exists a sequence 
{b n ,k, «n / fc)fc€N in B x X such that limjt 6]N lim^N = u„, and (V7c G N) M„ /fc G N^(b„ /k ). 

Hence there exists a subsequence (fc n )„ € jsj of (n) n€ ]N such that b,,^ — > c and 

£ 

(156) (Vn G N) (u nikn ,y n - b Hikn ) > -\\u n , kn \\ ■ Ily« - K,k n \\- 

However, this contradicts the A-superregularity of Bate. ■ 
When B = X, then Theorem 9.4 turns into [17, Proposition 4.4]: 
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Corollary 9.5 (Lewis-Luke-Malick) Let B be a nonempty subset ofX and let c E B. Then B is super- 
regular at c if and only if for every e E 1R++ there exists 8 E R++ such that 



We now introduce the notion of joint-regularity, which is tailored for collections of sets and 
which turns into Definition 9.1 when the index set is a singleton. 

Definition 9.6 (joint-regularity) Let A be a nonempty subset of X, let B := (By)ye/ he a nontrivial 
collection of nonempty subsets ofX, and let c EX. 

(i) We say that B is (A, e, 5) -joint-regular at cife>0,5> 0, and for every j E J, Bj is (A, e, 5)-regular 



(ii) The collection B is A-joint-superregular at c if for every j E }, Bj is A-superregular at c. 

As in Definition 9.1, we may omit the prefix A if A = X. 

Here are some verifiable conditions that guarantee joint-(super)regularity. 
Proposition 9.7 Let A := (Ay)y € ; and B := {Bj)j G j be nontrivial collections of nonempty subsets ofX, 



(i) If5>0 and (V; E J) Bj is (Ay, ey, 6j)-regular at c, then B is (A, e, 5) -joint-regular at c. 

(ii) If J is finite and (V; E J) Bj is (Ay, ey, bf)-regular at c, then B is (A, e, 5) -joint-regular at c. 

(iii) If J is finite and (V; E J) Bj is Aj-superregular at c, then B is A-joint-superregular at c. 

Proof, (i): Indeed, by Remark 9.2(v), By is (A, e, <S)-regular at c for every ;' E J. 

(ii) : Since / is finite, we have 5 > and so the conclusion follows from (i). 

(iii) : This follows from (ii) and the definitions. ■ 

Corollary 9.8 (convexity and regularity) Let B := (By)yej be a nontrivial collection of nonempty con- 
vex subsets of X, let A C X, and let c E X. Then B is (0, +oo) -joint-regular, (A, 0, +oo) -joint-regular, 
joint-superregular, and A-joint-superregular at c. 

Proof. By Remark 9.2(vi), By is (0, +co) -regular, superregular, and A-superregular at c, for every 
/ E J. Now apply Proposition 9.7(i)&(iii). ■ 

The following example illustrates the flexibility gained through the notion of joint-regularity. 



(157) 



(y,b) EBxB 
\\y — c\\ < 5, \\b — c\ 
u E N B (b) 




at c. 
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Example 9.9 (two lines: joint-superregularity ^> superregularity of the union) Suppose that d\ 
and d2 are in sphere(0; 1). Set B\ := ~Rd\, B2 := Rd2, and B := B\ U B2, and assume that B\ n B2 = 
{0}. By Corollary 9.8, (Bi, B2) is joint-superregular at 0. Let 5 G R++, and set b := 6d\ and 
y := Sd2- Then ||y — 0|| =S,\\b — 0|| = £,and0< ||y — b|| = 6\\d2 — d\\\. Using Proposition 3.3(iii), 
we see that Ns(fc) = {rfi} -1 - Note that there exists v G {di} 1 - such that {v,d-i) 7^ (for otherwise 
{di} L C {^2}^ => B 2 C Bi, which is absurd). Hence there exists w G {di}^ = {b}^ = N B (b) 
such that = 1 and (m,^) > 0. It follows that (w,y — fe) = (w,y) = <5 (u,^) = ("7^2) || M II ||y — 
&|| / 1|^2 — ^1 1| • Therefore, B is not superregular at 0. 



Let us provide an example of an A-superregular set that is not superregular. To do so, we 
require the following elementary result. 

Lemma 9.10 Consider in R 2 the sets C := [(0,1), (m, 1 + m 2 )] = {(x, 1 + mx) | x G [0,m]} and 
D := [(m, 1), (m, 1 + m 2 )], w/jere m G R++. Lef z G R. TTzen 

((0,1), ifz<m/2; 
(158) PcudM) = I {(0,1), (m,l)}, i/z = m/2; 

[(m,l), ifz>m/2. 



Proof. It is clear that Pd(z,0) = (m, 1). We assume that < z < m for otherwise (158) is clearly 
true. We claim that Pc(z,0 ) = (0,1). Indeed,/: x 1-4 ||(x,l + mx) — (z,0)|| 2 is a convex quadratic 
with minimizer x z := (z — ra)/(l + ra 2 ). The requirement x z > from the definition of C forces 
z > m, which is a contradiction. Hence Pc(z,0) is a subset of the relative boundary of C, i.e., of 
{ (0, 1), (m, 1 + m 2 ) }. Clearly, (0, 1) is the closer to (z, 0) than (m, 1 + m 2 ). This verifies the claim. 
Since Pqjd ( z , 0) is the subset of points in Pq (z, 0) U Pq (z, 0) closest to (z, 0), the result follows. ■ 

Example 9.11 (A-superregularity 7^ superregularity) Suppose that X = R 2 . As in [17, Exam- 
ple 4.6], we consider c := (0,0) G X and B := epi/, where 



(159) 



/: R -4 ]— 00,4-00] : x 1-4 < 



'2 fc (x-2 fc ), if 2 fc < x < 2 fc+1 and Jt G Z; 
0, if x = 0; 

k 4oo, if x < 0. 

Then B is not superregular at c; however, B is A-superregular at c, where A:=Rx{— 1}. 



Proof. It is stated in [17, Example 4.6] that B is not superregular at c (and that B is Clarke regular 
at c). 

To tackle A-superregularity, let us determine Pb(A). Let us consider the point a = (a, — 1), 
where a G [2 _1 , 1 [. Then Lemma 9.10 (see also the picture below) implies that 



r (io), 



(160) 



P B («,-1) = < 



if i < « < I; 



{(I,0),(1,0)}, if«- ^ 
1(1,0), 



2 — ■* ^ 4/ 

3. 
4' 

4 " 



if I < a < 1; 
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and more generally, 



r (2*,0), 



(161) 



2 k < a < 2 k+1 => P B («,-1) = < 



if 2 k < a < 2 k + 2*" 1 ; 



{(2 t ,0),(2W,0)}, if« = 2* + 2*- 1 ; 
k (2 k+1 , 0), if 2 fc + 2 fc - 1 < a < 2 fc+1 . 

Clearly, if a G R_ x {-1}, then P B («) = (0, 0). Let b € B. Then 

'[2 k - 2 + 2 k -\2 k - 1 +2 k ] x {-1}, if fc = (2 fc ,0) and G Z; 
(162) AnPg 1 ^) = I R_ x {-1}, ifb = (0,0); 

0, otherwise. 



Thus 



(163) 



'cone ([~2 k - 2 , 2^] x {-1}), if b = {2 k , 0) and k G Z; 
N B A W = |{(0,0)}u(R_xR__), if ft = (0,0); 

{ (0, 0) }, otherwise. 



Let e G R ++ . Let KeZ be such that 2 K ~ 1 < e, and let <S G ] 0, 2 K ] . Furthermore, let y = (1/1,1/2) G 
B, let b = (bi,& 2 ) G B, let m G N^(fc), and assume that \\y - c\\ < S and that \\b - c\\ < 5. We 
consider three cases. 
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Case 1: b = (0,0). Then u G K 2 and y G R 2 ; consequently, (u,y — b) = (u,y) < < e||u|| • \\y — 
b\\- 

Case 2: b £ ({0} U 2 Z ) x {0}. Then N£(b) = {(0,0}; hence u = and so (u,y - b) = < 

e ll"ll • \\y-H- 

Case 3: b G 2 Z x {0}, say b = (2 fc ,0), where fc G Z. Since 2 k = \\b — 0|| = \\b — c\\ < 5 < 2 K , 
we have k < K. Furthermore, 1/2 > 0, max{|i/i — — fc> 2 1 } < ||y _ ^||/ and w = A(f, —1) = 

(At, - A) where £ G [-2 fc - 2 ,2 fc - 1 ] and A > 0. Hence A < and 

(164a) (u,y -b)= At(yi - bi) - A(y 2 - b 2 ) = A£( yi - bj) - A(y 2 - 0) 

(164b) <M(y 1 -b 1 )<X\t\-\y 1 -b\ 

(164c) < ||u|| •2^ 1 • ||y-b|| < 2 K - 1 || U || • ||y-b|| < e- ||w|| • ||y-b||. 



Therefore, in all three cases, we have shown that (u,y — b) < e\\u\\ ■ ||y — b\\. ■ 
We now use Example 9.11 to construct an example complementary to Example 9.9. 

Example 9.12 (superregularity of the union ^> joint-superregularity) Suppose that X = R 2 , set 

£>i := epi/, where / is as in Example 9.11, £> 2 := X \ B\, and c := (0,0). Since B\ U £> 2 = X is 
convex, it is clear from Remark 9.2(vi) that £>i U £> 2 is superregular at c. On the other hand, since 
£>i is not superregular at c (see Example 9.11), it is obvious that (£>i, B 2 ) is not joint-superregular 
at c. 



10 The method of alternating projections (MAP) 

We now apply the machinery of restricted normal cones and associated results to derive linear 
convergence results. 

On the composition of two projection operators 

The method of alternating projections iterates projection operators. Thus, in the next few results, 
we focus on the outcome of a single iteration of the composition. 

Lemma 10.1 Let A and B be nonempty closed subsets ofX. Then the following hold 3 : 

(i) P A (B \ A) C bdry affAUB A C bdry A. 

(ii) P B (A \ B) C bdry affAUB (B) C bdryB. 

3 We denote by bdry aff AUB (S) the boundary of S C X with respect to aff(A U B). 
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(iii) Ifb G B and a G P A b, then: 



(165) 



a G (bdry A) \ B agA\B => b G B \ A a G bdry A. 



(iv) Jf a G A and b G Psfl, then: 



(166) 



G (bdryB) \ A b G B \ A => a G A \ B b G bdryB. 



Proof, (i): Take b G B \ A and a G P^b- Assume to the contrary that there exists 6 G K++ such 
that aff(A U B) nball(a;<$) C A. Hence a := a + S(b - a)/\\b - a\\ G A and thus d A {b) < d{a,b) < 
d(a, b) = d A (b), which is absurd. 

(ii) : Interchange the roles of A and B in (i). 

(iii) : If a G (bdry A) \ B, then clearly a G A \ B. Now assume that a G A \ B. If b G A, then 
a G P^b = {b} C B, which is absurd. Hence b G B \ A and thus (i) implies that a G Pa(B \ A) C 
bdry A. 

(iv) : Interchange the roles of A and B in (iii). ■ 

Lemma 10.2 Let A and B be nonempty closed subsets ofX, let c G X,lety G B, Ze£ a G P^y, let b G Pgfl, 
and let 5 G K+. Assume that d A (y) < 3 and that d(y,c) < 5. Then the following hold: 

(i) d{a,c) < 25. 

(ii) d{b,y) < 2d(a,y) < 25. 

(iii) d(b,c) < 35. 

Proof. Since y G B, we have 

(167) d(a,b) = d B (a) < d(a,y) = d A (y) < 5. 



which establishes (i). Using (167), we also conclude that d(b,y) < d(b,a) +d{a,y) < 2d(a,y) < 25; 
hence, (ii) holds. Finally, combining (167) and (168), we obtain (iii) via d(b, c) < d(b, a) + d(a, c) < 



Corollary 10.3 Let A and B be nonempty closed subsets ofX, let p G and suppose that c G A n B. 
Then 



Thus, 



(168) 



d(a,c) < d{a,y) + d(y,c) < 5 + 5 — 25, 



5 + 25 = 35. 



(169) 



P A P B PAball(c;p) CbaU(c;6p). 
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Proof. Let b-\ G ball^p), «o £ Pa^-i, bo G Pb«0/ and fli G Pa^o- We have d(ao,b-i) = d A (b-\) < 
d(b-i,c) < p, so c?b(«o) < d(ao,c) < d(ao,b-i) + d(b-\,c) < 2p. Applying Lemma 10.2(iii) to the 
sets B and A, the points do, bo, a\, and J = 2p, we deduce that d(a\, c) < 3(2p) = 6p. ■ 

The next two results are essential to guarantee a local contractive property of the composition. 

Proposition 10.4 (regularity and contractivity) Let A and B be nonempty closed subsets of X, let A 
and B be nonempty subsets ofX, let c G X, let e > 0, and let 8 > 0. Assume that B is (A,e,38)-regular 
at c (see Definition 9.1). Furthermore, assume that y G B n B, that a G Pa{v) n A, that b G Ps{ a )> 
\\y — c|| < 8, and that d A (y) < 8. Then 

(170) \\a-b\\ < (e 3 s + 2e)\\a-y\\, 
where 3 s the CQ-number at c associated with (A, A, B, B) (see (95)). 

Proof. Lemma 10.2(i)&(iii) yields \\a - c\\ < 25 and \\b - c\\ < 38. On the other hand, y - a G n|(«) 
and b — a G —N^(b). Therefore, 

(171) {b — a,y — a) < 6 3 $\\b — a\\ ■ \\y — a\\. 

Since a — b G Ng (£>), \\y — c\\ < 3, and \\b — c\\ < 35, we obtain, using the (A, e, 38) -regularity of B, 
that (a — b,y — b) < e\\a — b\\ ■ \\y — b\\. Moreover, Lemma 10.2(h) states that ||y — b\\ < 2\\a — y\\. 
It follows that 

(172) (a-b,y-b) < 2e\\a - b\\ ■ \\a-y\\. 

Adding (171) and (172) yields \\a - b\\ 2 < {6 3S + 2e)\\a -fc||-||a-y||. The result follows. ■ 
We now provide a result for collections of sets similar to — and relying upon — Proposition 10.4. 

Proposition 10.5 (joint-regularity and contractivity) Let A := (Aj) !G j and B := (By)y G j be nontriv- 
ial collections of closed subsets of X, Assume that A := U/ G i A, an d B := |Jy G jBy are closed, and that 

c G A n B. Let A := (A^-gj and B := (By)y e j be nontrivial collections of nonempty subsets ofX such 
that (Vz G I) P A; ( (bdryB) \ A) C A { and (V; G /) P B; .((bdryA) \ B) C By. Set A := \J ieI A, and 
B := U; G / By lete>0 and let 8 > 0. 

(i) Ifb G (bdryB) \Aanda<E P A (b), then (3i G I) a G P4,(» C A, n A,. 

(ii) If a G (bdry A) \ B and b G Pb(a), fken (3; G /) & G Pb^a) C By n By. 

(iii) If y G B, a G P^(y) & G Pb(#), f^en: 

(173) & G ((bdryB) \ A) D (J(By n By) O & G B \ A a G A \ B. 
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(iv) If x G A,b G P B {x), and a G fU(b)/ then: 

(174) « G ((bdry A) \ B) n (J (A n A,-) O a G A \ B G B \ A. 

iel 

(v) Suppose that B is (A, e, 36) -joint-regular at c (see Definition 9.6), that y G ((bdryB) \ A) D 
\Jj G j(Bj n By), that a G Pa{v)' that b G P B {a), and that \\y — c\\ < 6. Then 

(175) ||fc-fl|| < (e 3S + 2e)\\a-y\\, 

where 9 3 g is the joint-CQ-number at c associated with (A, A, B, B) (see (101)). 

(vi) Suppose that A is (B,e, 36)- joint-regular at c (see Definition 9.6), that x G ((bdry A) \ B) n 
U; 6 i(A; fl A;), that b G P B (x), that a G Pa(P), and that \\x — c\\ < 6. Then 

(176) \\a-b\\ <(9 3S + 2e)\\b-x\\, 

where 633 is the joint-CQ-number at c associated with (A, A, B, B) (see (101)). 

Proof. (i)&(ii): Clear from Lemma 2.4 and the assumptions. 

(iii) : Note that Lemma 10.1(iv)&(iii) and (ii) yield the implications 

(177) b G B \ A o be (bdryB) \ A a G A \ B o a€ (bdry A) \ B => b G \J(Bj D Bj), 

which give the conclusion. 

(iv) : Interchange the roles of A and B in (iii). 

(v) : There exists ;' G / such that y G By n By n ( (bdry B) \ A) . Let b' G P Bj a. Then 

(178) ||fl - b\\ = d B (a) < d B .(a) = \\a - b'\\. 

Since Bis (A, e, 36) -joint-regular at c, it is clear that By is (A, e, 36) -regular at c. Since y G (bdry B) \ 
A and because of (i), there exists i G I such that a G P^y C A[. Since A, C A, it follows that (see 
also Remark 9.2(v)) By is ( A i,£,36) -regular at c. Since y G By n By, a G P^y n A,-, b' G Pb ; a, and 
d^i(y) = d-A(y) = \\y — a \\ < ||y — c|| < <!>, we obtain from Proposition 10.4 that 

(179) < (MAv A,-, By, By) +2e)||fl-y || . 
Combining with (178), we deduce that \\a — b\\ < \\a — b'\\ < (8 3 s + 2e)||fl — y\\. 

(vi) : This follows from (v) and (97). ■ 
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An abstract linear convergence result 

Let us now focus on algorithmic results (which are actually true even in complete metric spaces). 

Definition 10.6 (linear convergence) Let (x n ) ne isj be a sequence in X, let x G X, and let 7 G [0,1[. 
Then (x n )„ e i\j converges linearly to x with rate 7 if there exists w G R + such that 

(180) (Vn G N) d(x n ,x) < 7*7". 

Remark 10.7 (rate of convergence depends only on the tail of the sequence) Let (x n ) n ^f^ be a 

sequence in X, let x G X, and let 7 G ]0,1[. Assume that there exists no £ N and Uq G K+ 
such that 

(181) (Vn G {«0/ M o + 1, • • •}) d(x„,x) < «o7"- 
Set Mi := max {d(x m ,x)/7 m | m G {0, 1, . . . ,uq — 1}}. Then 

(182) (Vn G N) d(x n ,x) < max{n ,«i}7 n / 
and therefore (x n )neN converges linearly to x with rate 7. 

Proposition 10.8 (abstract linear convergence) Let A and B be nonempty closed subsets of X, let 
( a n)neiN be a sequence in A, and let (fc„)„ eN be a sequence in B. Assume that there exist constants 
a. G K+ and 8 G K+ such that 

(183a) 7 := olB < 1 

and 

(183b) (Vn G N) d(a n +\,b n ) < a.d(a n ,b n ) and d(a n +\,b n+ \) < Bd{a n+ \,b n ). 

Then (Vn G N) d(a n+ \, b n+ \) < jd(a n , b„) and there exists c G A n B suc/z £/zaf 

1 + a 

(184) (Vn G N) max {d(fl W/ c),d(b n ,c)} < - — -d(a , b ) • 7"; 
consequently, (a n )neN (b n )neN converge linearly to c with rate 7. 

Proof. Set S := d(ao,bo). Then for every n G N, 

(185) d(a„,b n ) < 8d(a n ,b n -i) < xBd(a n -i,b n -i) = ifd(a n -\,b n -\) <■■■ < 7"^; 
hence, 

(186a) d(b n , b n+1 ) < d{b n , a n+1 ) + d(a n+1/ b n+1 ) < ad(b n , a n ) + yd(a n , b„) 

(186b) = (a + j)d(a n , b n ) <(cc + j)5j n . 
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Thus (b„) ne fj is a Cauchy sequence, so there exists c G B such that b„ — > c. On the other hand, by 
(185), d(a„,b„) —> and (fl K ) n elN nes in -A- Hence, «„ — >■ c and c£ A Thus, c G A n B. Fix n G N 
and let m > n. Using (186), 



m— 1 

(187) </(&„,&„) < E d(h,b k +i) < £d(b k) b k+1 ) < £ (a + 7)^7 

A:=« fc>n fc>n 

Hence, using (185) and (187), we estimate that 

(188) d{a n , b m ) < d(a n , b„) + d(b n , b m ) < Sj 
Letting m -> +oo in (187) and (188), we obtain (184). 



k _ (ft + 7^7" 
1-7 ' 



„ (a + 7)^7" _ (l + a)^7 n 



1-7 



1-7 



The sequence generated by the MAP 

We start with the following definition, which is well defined by Proposition 2.2. 
Definition 10.9 (MAP) Let A and B be nonempty closed subsets ofX, let b-\ G X, and let 
(189) (VhgN) a n e P A {K-\) andb n G P B («»)- 

Then we say that the sequences (a n ) n ^ an d (fci,)neN are generated by the method of alternating 
projections (with respect to the pair (A, B)) with starting point b-\. 



The MAP between 
A = A x U A 2 and B, 
A n B = {ci,c 2 } 




Our aim is to provide sufficient conditions for linear convergence of the sequences generated 
by the method of alternating projections. The following two results are simple yet useful. 

Proposition 10.10 Let A and B be nonempty closed subsets of X, and let (a n ) and (b„) be sequences 
generated by the method of alternating projections. Then the following hold: 
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(i) The sequences (fl„) n eN ana " (b n )n&i lie in A and B, respectively. 

(ii) (Vn G N) ||a n+1 - fc M+ i|| < \\a n+1 - b n \\ < \\a n - b n \\. 

(iii) If {a n } n <3<i n B 7^ 0, or {fo„}„ G ]N flA / 0, ffren ffrere exists c £ AflB such that for all n 
sufficiently large, a„ = b„ = c. 

Proof, (i): This is clear from the definition. 

(ii) : Indeed, for every n G N, ||fl n +i ~~ &n+i|| = ^B( fl «+i) < || fl n+i — &n|| = d^(b„) < \\b n — a n \\ 
using (i). 

(iii) : Suppose, say that a„ G B. Then b„ = Pgfl„ = a n =: c <E A (1 B and all subsequent terms of 
the sequences are equal to c as well. ■ 

New convergence results for the MAP 

We are now in a position to state and derive new linear convergence results. In this section, we 
shall often assume the following: 





' A := (Ai)j e i and B := (B,-), € ; are nontrivial collections 




of nonempty closed subsets of X; 




A := (J A; and B := (J B ; are closed; 




ieZ ;e/ 




c G An B; 


(190) 


^4 := (A,-),- e j and £> := (B/)/e/ are collections 




of nonempty subsets of X such that 




(Vie I) P A ,.((bdryB)\A) C A,-, 




(V/G J) P B .((bdryA)\B) CB y ; 




A := [j Ai and B:=\J By. 







Lemma 10.11 (backtracking MAP) Assume i/zat (190) /joZrfs. Let (a n )neN (6 n )neN ^ e generated 
by the MAP with starting point b-\. Let n G {1, 2, 3, ... }. T/ien the following hold: 



(i) J/b„ £ A, f/zen a„ G ((bdry A) \ B) n U !€ j(A- n A,) and b„ G ((bdry B) \ A) n U/ej( B / n B /0- 

(ii) 7/a„ ^ B, then a n G ((bdry A) \ B) n U«ei(A' n A,). 
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(iii) If On t B and n > 2, then b n -\ G ((bdry B) \ A) n U/ € /( B ; n 

Proof, (i): Applying Proposition 10.5(iii) to b n -\ G B,a n G P^fc„_i, b„ G Ps^n, we obtain 

(191) b„ G B \ A o b„ G ((bdryB) \ A) n [_J(B ; - n B ; ) => a„ G A \ B. 

On the other hand, applying Proposition 10.5(iv) to a n -\ G A, b n -\ G Pgfl n _i, a„ G V A b n -\, we see 
that 

(192) a„ G A \ B O a„ G ((bdry A) \ B) D (J (A; D A,). 

16 J 

Altogether, (i) is established. 

(ii)&(iii): The proofs are analogous to that of (i). ■ 

Let us now state and prove a key technical result. 

Proposition 10.12 Assume that (190) holds. Suppose that there exist e > and S > such that the 
following hold: 



(i) A is (B, e,3S)-joint-regular at c (see Definition 9.6) and set 
(193) a :-- 



1, ifB is not known to be (A, e, 3S)-joint-regular at c; 

2, ifB is also (A, e, 3S)-joint-regular at c. 



(ii) 03,5 < 1 — 2e, w/zere ^ is the joint-CQ-number at c associated with (A, A, B, B) (see Definition 7.2). 

Set 9 := 9 3 $ + 2e G ]0,1[. Let (fl n )neN an d (b n )neK be sequences generated by the MAP with starting 
point b_i satisfying 

(194) l»_,-c||< {l ~^ S 



6(2 + 9 - &r) ' 

Then (fl n )neN and (fr n )neN converge linearly to some point c G A n B with rate 6 a ; in fact, 
(195) \\c-c\\<5 and (Vn > 1) max {\\a n - c\\, \\b n - c\\ } < ij^t^, ^"- 1 ). 

Proof. In view of «i G P A P B P A b_i and (194), Corollary 10.3 yields 
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Since c G A n B, we have 9$$ > by (98) and hence 6 > 0. Using (196), we estimate 

n-2 «-l 

(197a) (Vn > 1) B9 a{ - n ~^ + 6 + yS(l + 9) £ 0** < P + (6(1 + 0) I] 0"* 

*:=0 fc=0 

(197b) =/3 + / 3 ( i + 0)___ 

(197c) < 6 + 6±±± 

(197e) < 5. 



We now claim that if 

n-2 

(198) n>l, \\a n -b„\\ < B9^ n ~^ and \\a n - c\\ < B + B(l + 9) ^ 9 ak , 

k=0 

then 

(199a) ||a B+1 - b n+1 \\ < 9 a - 1 \\a n+1 - b n \\ < 9 a \\a n - b„\\ < B9 an , 

n-l 

(199b) K+i-c|| < B + 8(1 + 9) Y^9 ffk . 

To prove this claim, assume that (198) holds. Using (198) and (197), we first observe that 
(200a) max { \\a n — c\\, \\b n — c\\ } < \\b n — a n \\ + \\a n — c\\ 

n-2 

(200b) < 86^"-^ + B + B(1 + 6)J2 ^ < 



We now consider two cases: 



Case 1: b n E A (~) B. Then b n = a n+ \ = b n+ \ and thus (199a) holds. Moreover, ||fl„+i — c|| = 
\\b n - c\\ and (199b) follows from (200a). 

_ Case 2: b n G" A n B. Then b n E B \ A. Lemma 10.11(i) implies fl„ G ((bdry A) \ B) n U;ei(A- n 
A-) and G ((bdry B) \ A) n U/ej( B ; n B ;)- Note that IK - c\\ <S by (200a), and recall that A is 
(B,e,3<5) -joint-regular at c by (i). It thus follows from Proposition 10.5(vi) (applied to a n , b n ,a n+ i) 
that 

(201) ||fl n+ i - b n \\ < 6\\a n - b n \\. 

On the one hand, if a = 1, then Proposition lO.lO(ii) yields \\a n+ i — b n+ \\\ < \\a n+ \ — b n \\ = 
9 <T ~ 1 \\a n+ i — b n \\. On the other hand, if a = 2, then B is (A, e, 35) -joint-regular at c by (i); hence, 
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n-2 

,ak 



Proposition 10.5(v) (applied to b nr a n+ \,b n+ i) yields ||fl n +i — fen+i|| < #||fln+i — fen II = Q a 1 \Wn+i — 
fe n ||. Altogether, in either case, 

(202) \\a n+1 - fe„ +1 || < fl^K+i -b n \\. 
Combining (202) with (201) and (198) gives 

(203) |K+i - b n+1 \\ < e a - l \\a n+1 -b n \\< e a \\a n - b n \\ < pe m , 

which is (199a). Furthermore, (201), (198) and (200a) yield 

(204a) ||a b+1 - c\\ < \\a n+1 - b„\\ + \\b n - c\\ 

(204b) < 0\\a„ - b„\\ + \\b n - c\\ 

(204c) < 6^"-^ + BO^"^ + B + /3(1 + 6) £ & 

n-l 

(204d) = B + 0(1 + 0) £ 0°*, 

which establishes (199b). Therefore, in all cases, (199) holds. 

Since ||fli — bi\\ = ds(fli) < ||«i — c|j = B, we see that (198) holds for n = 1. Thus, the above 
claim and the principle of mathematical induction principle imply that (199) holds for every n > 1. 

Next, (199a) implies 

(205) (Vn > 1) ||fl„+i _ || < _ b n \\ an d ||fl«+i — fe«+i|| < a ~ 1 \\a n +i — b„\\. 
In view of (205) and ||fli — fei || < B, Proposition 10.8 yields c E A n B such that 

(206) (Vn>l) max { \\a n - c\\,\\b n - c\\] < ^^ll«i ~M • ^""^ 

(207) <l±l[i.0-(n-i) 

(208) < * (1 „ +e > r^- 1 ). 

v ' - 2 + 6 -6 a 

On the other hand, (199b) and (197) imply (Vn > 1) ||fl n +i _ c || < thus, letting n —> +oo, we 
obtain ||c — c|| < S. This completes the proof of (195). ■ 

Remark 10.13 In view of Lemma 10.1(i)&(ii), an aggressive choice for use in (190) is (Vz £ I) 
Ai = bdry A { and (V; <E /) Bj = bdry Bj. 



Our main convergence result on the linear convergence of the MAP is the following: 

Theorem 10.14 (linear convergence of the MAP and superregularity) Assume that (190) holds and 
that A is B-joint-superregular at c (see Definition 9.6). Denote the limiting joint-CQ-number at c associ- 
ated with (A, A, B, B) (see Definition 7.2) by 9, and the the exact joint-CQ-number at c associated with 
(A, A, B, B) (see Definition 7.7) by a. Assume further that one of the following holds: 
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(i) e < l. 



(ii) I and J are finite, and a. < 1. 

Let 9 G ] 9, 1 [ and set e := (6 — 8) /3 > 0. T/jen ffere exists 5 > suc/i f/wf the following hold: 

(iii) ^4 is (B,e,3S)-joint-regular at c (see Definition 9.6). 

(iv) 3( 5 < 9 + e < 1 — 2e, where 9^s is the joint-CQ-number at c associated with (A,A,B,B) (see 
Definition 7.2). 

Consequently, suppose the starting point of the MAP b-\ satisfies — c|| < (1 — 6)5/12. Then 
(fln)neN and (£> n )neN converge linearly to some point in c G ADB with \\c — c\\ < 5 and rate 9: 

(209) (Vn > 1) max{||fl„ - c\\, \\b„ - c\\} < + 

Proof. Observe that (ii) implies (i) by Theorem 7.8(iv). The definitions of B-joint-superregularity 
and of 9 allow us to find S > sufficiently small such that both (iii) and (iv) hold. The result thus 
follows from Proposition 10.12 with a = 1. ■ 

Corollary 10.15 Assume that (190) holds and that, for every i G I, A[ is convex. Denote the limiting 
joint-CQ-number at c associated with (A, A, B, B) (see Definition 7.2) by 9, and assume that 8 < 1. Let 
9 e ] 9, 1 [, and let b_ lr the starting point of the MAP, be sufficiently close to c. Then (fl n )neN an d (b n )n£N 
converge linearly to some point in ADB with rate 8. 

Proof. Combine Theorem 10.14 with Corollary 9.8. ■ 

Example 10.16 (working with collections and joint notions is useful) Consider the setting of 
Example 7.4, and suppose that A = A and B = B. Note that A, is convex, for every i G I. 
Then 9 3 {A, A,B,B) < 1 = 9 S (A,A,B,B) = 9(A,X,B,X). Hence Corollary 10.15 guarantees lin- 
ear convergence of the MAP while it is not possible to work directly with the unions A and B due 
to their condition number being equal to 1 and because neither A nor B is superregular by Exam- 
ple 9.9! This illustrates that the main result of Lewis-Luke-Malick (see Corollary 10.24 below) is 
not applicable because two of its hypotheses fail. 

The following result features an improved rate of convergence 6 2 due to the additional presence 
of superregularity. 

Theorem 10.17 (linear convergence of the MAP and double superregularity) Assume that (190) 
holds, that A is B-joint-superregular at c and that B is A-joint-superregular at c (see Definition 9.6). 
Denote the limiting joint-CQ-number at c associated with (A, A, B, B) (see Definition 7.2) by 9, and the 
the exact joint-CQ-number at c associated with (A,A,B,B) (see Definition 7.7) by a. Assume further 
that (a) 8 < 1, or (more restrictively) that (b) J and ] are finite, and a < 1 (and hence 9 = a. < 1). Let 
8 G ] 8, 1 [ and e := Then there exists 5 > such that 
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(i) A is (B,e,35)-joint-regular at c; 

(ii) B is (A, e, 3S)-joint-regular at c; and 

(iii) 633 < 8 + £ = — 2e < 1 — 2e, where 63s is the joint-CQ-number at c associated with (A, A, B, B) 
(see Definition 7.2). 

Consequently, suppose the starting point of MAP b-\ satisfies — c\\ < g^-e) • Then (fl n )neN an d 
(^n)neN converge linearly to some point in c G ADB with \\c — c|| < 8 and rate 8 2 ; in fact, 

(210) (Vn>l) max{|K-c||,||& n -c||} < ^(e 2 )"- 1 . 

Proof. The existence of £ > such that (i)-(iii) hold is clear. Then apply Proposition 10.12 with 
or = 2. U 

In passing, let us point out a sharper rate of convergence under sufficient conditions stronger 
than superregularity. 

Corollary 10.18 (refined convergence rate) Assume that (190) holds and that there exists S > such 
that 

(i) A is (B,0,35)-joint-regular at c; 

(ii) B is (A,0,3S)-joint-regular at c; and 

(iii) 6 < 1, where 6 := 6 3S is the joint-CQ-number at c associated with (A, A, B, B) (see Definition 7.2). 

Suppose also that the starting point of the MAP b-\ satisfies — c\\ < j^-ly Then (« n )neN an d 
(frjj)neN converge linearly to some point in c G ADB with \\c — c\\ < 5 and rate 8 2 ; in fact, 

(211) (Vn>l) m a x{\\a n -c\\,\\b n -c\\} <^(e 2 ) n -\ 

Proof. Apply Proposition 10.12 with a = 2. ■ 
Let us illustrate a situation where it is possible to make 5 in Theorem 10.17 precise. 

Example 10.19 (the MAP for two spheres) Let Z\ and Z2 be in X, let p\ and P2 be in R, set 

A := sphere (zi; pi) and B := sphere (2,2; pi), and assume that {c} ^ ADB ^ AUB. Then 
a := I (zi — c,Z2 — c) |/ {p\pi) < 1- Let G ]a, 1[. Then the conclusion of Theorem 10.17 holds 
with 

(212) (5 := min 



V (pi + P2) 2 + pip2{0 - a) - (pi + P2) cpi fP_2 
6 ' 3 ' 3 
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Proof. Combine Example 9.3 (applied with e = (6 — a)/4 there), Proposition 8.4, and Theo- 
rem 10.17. ■ 

Here is a useful special case of Theorem 10.17: 

Theorem 10.20 Assume that A and B are L-superregular, and that 
(213) N A (c)n(-N B (c))n(L-c) ={0}, 

where L := aff(A UB). Then the sequences generated by the MAP converge linearly to a point in ADB 
provided that the starting point is sufficiently close to c. 



Proof. Combine Example 8.2 with Theorem 10.17 (applied with I and / being singletons, and with 
A = B = L). ' ■ 

We now obtain a well known global linear convergence result for the convex case, which does 
not require the starting point to be sufficiently close to A n £>: 

Theorem 10.21 (two convex sets) Assume that A and B are convex, and A n B ^ 0. Then for every 
starting point b-\ G X, the sequences (fl n )neN and (fc n )neN generated by the MAP converge to some point 
in ADB. The convergence of these sequences is linear provided that ri A n ri B ^ 0. 



Proof. By Fact 2.5(iv), we have 

(214) (VceAnB) ||flo-c|| > ||feo-c|| > ||«i-c|| > ||&i-c|| > ■■■ 

After passing to subsequences if needed, we assume that a kn — > a G A and b kn — > b G B. We 
show that a = b by contradiction, so we assume that e := \\a — b\\/3 > 0. We have eventually 
max{||flfc n — a\\, \\b kn — b\\ } < e; hence \\a kn — b kn \\ > e eventually. By Fact 2.5(iii), we have 

(215) \\a kn -c\\ 2 > \\a kn - b kn \\ 2 + \\b kn - c\\ 2 > e 2 + \\a kn+1 - c\\ 2 > e 2 + \\a kn+1 - c\\ 2 

eventually. But this would imply that for all n sufficiently large, and for every m G N, we have 
W^kn — c\\ 2 > me 2 + \\a kn+m — c\\ 2 > me 2 , which is absurd. Hence c:=a = beADB and now (214) 
(with c = c) implies that a n —> c and b n —> c. 

Next, assume that ri A n ri B ^ 0, and set L := aff(A U B). By Proposition 8.5, the (A,L,B,L)- 
CQ conditions holds at c. Thus, by Example 8.2, N A (c) n (— Ng(c)) n (L — c) = {0}. Furthermore, 
Corollary 9.8 and Remark 9.2(vi)&(viii) imply that A and B are L-superregular at c. The conclusion 
now follows from Theorem 10.20, applied to suitably chosen tails of the sequences (fl n )neN an d 

Example 10.22 (the MAP for two linear subspaces) Assume that A and B are linear subspaces of 
X. Since G A D B = ri A n ri B, Theorem 10.21 guarantees the linear convergence of the MAP to 
some point in A H B, where b-\ G X is the arbitrary starting point. On the other hand, A and B 
are (0, +oo) -regular (see Remark 9.2(vi)). Since (W G R++) S (A,A,B,B) = c(A,B) < 1, where 
c(A, B) is the cosine of the Friedrichs angle between A and B (see Theorem 8.12), we obtain from 
Corollary 10.18 that the rate of convergence is c 2 (A, B). In fact, it is well known that this is the 
optimal rate, and also that lim„ a n = lim„ b n = Pac\b{^-i)', see [10, Section 3] and [11, Chapter 9]. 
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Remark 10.23 For further linear convergence results for the MAP in the convex setting we refer 
the reader to [1], [2], [3], [12], [13], [14], and the references therein. See also [20] and [21] for recent 
related work for the nonconvex case. 



Comparison to Lewis-Luke-Malick results and further examples 

The main result of Lewis, Luke, and Malick arises as a special case of Theorem 10.14: 

Corollary 10.24 (Lewis-Luke-Malick) (See [17, Theorem 5.16].) Suppose that N A (c) n (-N B (c)) = 
{0} and that A is superregular at c G AD B. If the starting point of MAP is sufficiently close to c, then 
the sequences generated by the MAP converge linearly to a point in ADB. 

Proof. Since N A (c) n (-N B (c)) = {0}, we have 9 < 1. Now apply Theorem 10.14(i) with A := 
B := (X), A := (A) and B := (B). ■ 

However, even in simple situations, Corollary 10.24 is not powerful enough to recover known 
convergence results. 

Example 10.25 (Lewis-Luke-Malick CQ may fail even for two subspaces) Suppose that A and B 
are two linear subspaces of X, and set L := aff (A U B) = A + B. For c G A n B, we have 

(216) N A (c) n (-Nb(c)) = A 1 n6 1 = (^ + B) x = L ± . 

Therefore, the Lewis-Luke-Malick CQ (see [17, Theorem 5.16] and also Corollary 10.24) holds for 
(A, B) at c if and only if 

(217) N A (c) n (-N B (c)) = {0} & A + B = X. 

On the other hand, the CQ provided in Theorem 10.20 (see also Example 10.22) always holds and we 
obtain linear convergence of the MAP. However, even for two lines in R 3 , the Lewis-Luke-Malick 
CQ (see Corollary 10.24) is unable to achieve this. (It was this example that originally motivated 
us to pursue the present work.) 

Example 10.26 (Lewis-Luke-Malick CQ is too strong even for convex sets) Assume that A and 
B are convex (and hence superregular). Then the Lewis-Luke-Malick CQ condition is G 
int(B - A) (see Corollary (i)) while the (A, aft (A U B), B, aft (A U B))-CQ is equivalent to the much 
less restrictive condition ri A n ri B ^ (see Theorem 4.13). 

The flexibility of choosing (A, B) 

Often, L = aff(A U B) is a convenient choice which yields linear convergence of the MAP as in 
Theorem 10.20. However, there are situations when this choice for A and B is not helpful but 
when a different, more aggressive, choice does guarantee linear convergence: 
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Example 10.27 ((A, B) = (A, £>)) Let A, B, and c be as in Example 8.3, and let L := aff(A U B). 
Since A and B are conwex and hence superregular, the (A, L, B, L)-CQ condition is equivalent to 
ri A n ri B ^ (see Proposition 8.5), which fails in this case. However, the (A, A, B, B)-CQ con- 
dition does hold; hence, the corresponding limiting CQ-number is less than 1 by Theorem 7.8(v). 
Thus linear convergence of the MAP is guaranteed by Theorem 10.17. 

The next example illustrates a situation where the choice (A, B) = (A, B) fails while the even 
tighter choice (A, B) = (bdry A,bdryB) results in success: 

Example 10.28 ((A, B) = (bdry A, bdry B)) Suppose that X = R 2 , that A = epi(| • \/2), that B = 
- epi( | • | /3), and that c = (0, 0) . Note that aff (A U B) = X and ri A n ri B = 0. Then 



(218a) N|(c) = N|(c) = N A (c) = {(ui,u 2 ) G K 2 | u 2 + 2\u x \ < 0}, 

(218b) N£(c) = Nf (c) = N B (c) = {(ui,u 2 ) G K 2 | -« 2 + 3|«i| < 0}, 

and so the (A, A, B, B)-CQ condition fails because 

(219) Nl(c) n {-N£(c)) = {(wi,u 2 ) G K 2 | w 2 + 3|«i| < 0} ^ {0}. 



Consequently, for either (A, B) = (A, B) or (A, B) = (X, X), Theorem 10.17 is not applicable 
because 1 = 9 = 1: indeed, u = (0, —1) G Na(c) and v = (0, —1) G — Ng(c), so 1 = (w,z>) < a < 
1. 

On the other hand, let us now choose (A, B) = (bdry A, bdry B), which is justified by Re- 
mark 10.13. Then 

(220a) N\{c) = {(u x ,u 2 ) G K 2 | U2 + 2\u x \ = 0}, 

(220b) N$(c) = {(ui,u 2 ) GR 2 I -u 2 + 3\u 1 \ = 0}, 

N|(c) n (-N§(c)) = {0} and the (A,A,B,B)-CQ condition holds. Hence, using also Theo- 
rem 7.8(v), Theorem 10.21 and Theorem 10.17, we deduce linear convergence of the MAP. 

However, even the choice (A, B) = (bdry A, bdry B) may not be applicable to yield the desired 
linear convergence as the following shows. In this example, we employ the tightest possibility 
allowed by our framework, namely (A, B) = (P^((bdryB) \ A),Pg((bdry A) \ B)). 

Example 10.29 ((A, B) = (P A ((bdry B) \ A),P B ((bdry A) \ B)) ) Suppose that X = K 2 , that A = 

epifl • |), that B = -A,andthatc = (0,0). Then N^ dryB (c) = bdryB = — bdry A and N^ dryA (c) = 

bdry A; hence, the (A, bdry A, B,bdry B)-CQ condition fails because ryB (c) n (-Ng dryA (c)^ = 

bdryB ^ {0}. On the other hand, if (A, B) = (P A ((bdryB) \ A),P B ((bdry A) \ B)), then n| = 

{0} = Ng 4 = {0} because A = {c} = B. Thus, the (A, A, B, B)-CQ conditions holds. (Note that 
the MAP converges in finitely many steps.) 
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Conclusion 



We have introduced restricted normal cones which generalize classical normal cones. We have 
presented some of their basic properties and shown their usefulness in describing interiority con- 
ditions, constraint qualifications, and regularities. The corresponding results were employed to 
yield new powerful sufficient conditions for linear convergence of the sequences generated by the 
method of alternating projections applied to two sets A and B. A key ingredient were suitable 
restricting sets (A and £>). The least aggressive choice, (A, B) = (X, X), recovers the framework 
by Lewis, Luke, and Malick. The choice (A,B) = (aff(A U £>),aff(A U £>)) allows us to include 
basic settings from convex analysis into our framework. Thus, the framework provided here 
unifies the recent nonconvex results by Lewis, Luke, and Malick with classical convex-analytical 
settings. When the choice (A, B) = (aff(A U £>),aff(A U £>)) fails, one may also try more aggres- 
sive choices such as (A, B) = (A, B) or (A, B) = (bdry A, bdry B) to guarantee linear convergence. 
In a follow-up work [5] we demonstrate the power of these tools with the important problem of 
sparsity optimization with affine constraints. Without any assumptions on the regularity of the 
sets or the intersection we achieve local convergence results, with rates and radii of convergence, 
where all other sufficient conditions, particularly those of [18] and [17], fail. 
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